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PREFACE 


This volume has a threefold purpose; to explain the physical concepts 
of quantum mechanics, to describe the mathematical formalism, and to 
present illustrative examples of both the ideas and the methods. The 
book is intended to serve as a text at the graduate level and also as a 
reference book. It is assumed that the reader is reasonably familiar 
with atomic structure, classical mechanics, and differential equations. 
In addition, he should have had some contact with electromagnetic theory 
and, for the latter part of the book, with the special theory of relativity. 

The author believes that the analytical methods employed in the book 
will satisfy most theoretical physicists even though no attempt is made 
to achieve mathematical rigor. For example, there is little or no dis- 
cussion of the justification for the interchange of sum, derivative, and 
integral operations, or for the use of the 5 function. On the other hand, 
the physical reasons for the nature of the results obtained are investigated 
wherever possible. 

Problems are given at the end of each chapter. They are often used 
to illustrate or amplify points discussed in the text. Original theoretical 
papers are referred to throughout the book; the list is representative 
rather than exhaustive. Experimental results are, for the most part, 
quoted without reference, since the large amount of documentation 
required for an adequate survey seems out of place in a book on theoretical 
physics. Several other books on quantum mechanics and related sub- 
jects are referred to for more detailed discussions of particular topics. 

The scope of this volume is best outlined if the book is divided into 
three parts. The first three chapters constitute an introduction to 
quantum mechanics, in which the physical concepts arc discussed and 
the Schrodinger wave formalism is established. The detailed treatment 
of the wave function (Chap. Ill) may be omitted in a first reading. Th(^ 
next eight chapters comprise the central part of the book. This part 
presents exact solutions of the wave ecpuition for both energy-level and 
collision problems, the I leisenl)erg matrix formalism and transformation 
theory, approximation methods, radiation i.lieory, and soim^ ajDplications 
to atomic systems. Since the first eleven chapters correspond to a 
typical one-year graduate course, it seemed desirable to include a semi- 
classical treatment of electromagnetic radiation in the central part of 
the book (Chap. X) even though some of the results are obtained again in 
Chap. XIV. The last part of the book corresponds to a short course in 
what is often called advanced quantum mechanics. It consists of rela- 
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cle theory and an introduction to quantized field theory and 
xectrodynamics. 

all this book was written while the author was at the Uni- 
oi Pennsylvania, and he gratefully acknowledges the continued 
agement of Prof. G. P. Harnwell. He is also indebted to Drs. E. H. 
and S. Pasternack for helpful criticism of the early chapters, to 
W. Miller and L. Spruch for their careful reading of most of the manu- 
script, and to Profs. F, Bloch, E-. F. Christy, and W. W. Hansen for 
valuable comments on certain sections. It is a particular pleasure for 
the author to thank Prof. R. Berber for many discussions of both the 
conceptual and formal aspects of quantum mechanics that took place 
during the last eleven years. 

Finally, the author wishes to acknowledge his indebtedness to Prof. 
J. R. Oppenheimer for his introduction to several of the ideas and exam- 
ples that appear in the book. Indeed, the writing of this book in its 
present form owes much to the period from 1937 to 1940, which the 
author spent in association with Professor Oppenheimer. 


Stanford, California 
JunSj 1948 


Leonard I. Schiff 
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CHAPTER I 


THE PHYSICAL BASIS OF QUANTUM MECHANICS 

At the present stage of human knowledge, quantum mechanics can 
be regarded as the fundamental theory of atomic phenomena. The 
experimental data on which it is based are derived from physical events 
that lie almost entirely beyond the range of direct human perception. 
It is not surprising, therefore, that the theory embodies physical con- 
cepts that are foreign to common daily experience. These concepts did 
not appear in the historical development of quantum mechanics, how- 
ever, until a quite complete mathematical formalism had been evolved. 
The need for quantitative comparison with observation, which is the 
ultimate test of any physical theory, in this case led first to the formalism 
and only later to its interpretation in physical terms. 

It seems desirable in introducing the subject of quantum mechanics to 
depart from the historical order and preface the mathematical develop- 
ment with a discussion of the physical concepts. In this chapter we first 
review briefly the experimental background and the ideas of the old 
quantum theory, then discuss the newer physical concepts of uncertainty 
and complementarity, and finally lay the groundwork for the formalism 
that will be developed in its most familiar form in Chap. IL No attempt 
will be made to deduce the structure of the formalism from the funda- 
mental experiments; we shall try to make the theoretical development 
seem plausible rather than unique. The justification for the theory, 
then, will rest on the agreement between deductions made from it and 
experiments, and on the simplicity (in piinciple more than in practice) 
and consistency of the formalism. 

1. EXPERIMENTAL BACKGROUND 

Experimental physics prior to 1900 had <lemonstrated the existenc^e 
of a wide variety of phenomena, which for the most part were believed 
to be explicable in terms of what we now call classical theoretical physics. 
The motions of mechanical objects were successfully discussed in terms 
of Newton’s equations on both celestial and terrestrial scales. Appli- 
cation of this theory to molecular motions produced useful results in the 
kinetic theory of gases, and the discovery of the electron by J. J. Thom- 
son in 1897 consisted in showing that it behaved like a Newtonian particle. 

1 
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The wave nature of light had been strongly suggested by the diffraction 
experiments of Young in 1803., and was put on a firmer foundation 
by MaxwelFs discovery in 1864 of the connection between optical and 
electrical phenomena. 

Inadequacy of Classical Physics. The difficulties in the understand- 
ing of experimental results that remained at the beginning of this century 
were largely concerned with the development of a suitable atomic model 
and with the late discoveries of X rays and radioactivity. However, 
there were also difiSculties associated with phenomena that should have 
been understood but actually were not: such things as the spectral dis- 
tribution of thermal radiation from a black body, the low-temperature 
specific heats of solids, and the appearance of only 5 degrees of freedom 
in the motion of a free diatomic molecule at ordinary temperatures. 

The beginning of an understanding of the second class of difficulties 
was made by Planck in 1900, when he was able to explain the black-body 
spectrum in terms of the assumed emission and absorption of electro- 
magnetic radiation in discrete quanta, each of which contains an amount 
of energy E that is equal to the frequency of the radiation v multiplied 
by a universal constant h (called Planck^ s constant): 


E hv (1.1) 

This quantum idea was later used by Einstein in accounting for some of 
the experimental observations on the photoelectric effect. In this way 
the dual character of electromagnetic radiation became established: 
it sometimes behaves like a wave motion, and sometimes like a stream of 
corpuscular quanta. 

At about this time, the existence of discrete values for the measurable 
parameters of atomic systems (not only of electromagnetic radiation) 
became apparent through Einstein's and Debye's theories of the specific 
heats of solids, Ritz's classification of spectral lines, the experiment of 
Franck and Hertz on the discrete energy losses of electrons on collision 
with atoms, and (somewhat later) the experiment of Stern and Gerlach, 
which showed that the component of the magnetic moment of an atom 
along an external magnetic field has discrete values. 

Summary of Principal Experiments and Inferences. The theoreti(ial 
physics of the first quarter of this century thus contained two important 
inferences, obtained from the experiments and their interpretations, that 
had not existed in 1900: the dual character of electromagnetic radiation, 
and the existence of discrete values for physical quantities. The relations 
between the principal experimental conclusions and the theoretical 
inferences are shown schematically in Table 1 ; for a more detailed dis- 
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cussion and a bibliography, reference should be made to a book on atomic 
physics. 1 


Table 1. Relations between Experimental Interpretations and Theoretical 

Inferences 


Diffraction (Young 1803, Laue 1912) 

Black-body radiation (Planck 1900) 
Photoelectric effect (Einstein 1904) 

Compton effect (1923) 

Combination principle (Ritz-Rydberg 1908) J 
Specific heats (Einstein 1907, Debye 1912) 
Franck-Hertz experiment (1913) 
Stern-Gerlach experiment (1922) 


{ Electromagnetic 
waves 

{ Electromagnetic 
quanta 

{ Discrete values 
for physical 
quantities 


A third theoretical inference appeared in 1924 with the suggestion by 
de Broglie that matter also has a dual (particle-like and wave-like) 
character; he assumed that the relation between the momentum p of the 
particle and the length X of the corresponding wave is^ 

X . \ (1.2) 

Up to that time all the evidence had indicated that matter was composed 
of discrete Newtonian particles; in particular, sharp tracks of charged 
particles such as electrons and helium nuclei had been observed in expan- 
sion cloud chambers like that invented by C. T. R. Wilson in 1911. 
Shortly after this, however, Davisson and Germer (1927) and G. P. 
Thomson (1928) independently observed the diffraction of electrons by 
crystals, and thus confirmed de Broglie’s principal supposition. 


2. THE OLD QUANTUM THEORY 

What is now called the old quantum theory^ was initiated by the work 
of Planck on black-body radiation, and carried farther by Einstein and 
Debye. However, it was only after Rutherford’s discovery in 1911 that 
an atom consists of a small, massive, positively charged nucleus sur- 
rounded by ele(dTons, that the theory could be applied to a quantitative 
description of atoms. 

1 Se(^, for oxjimplo, E. K. Riclitinyor and E. H. Koimard, “Introduction to Modern 
Physics” (M(^Oraw-Hill, New York, 1942); M. Born, “Atomic Physics” (Blackio, 
Glasgow, 1940). 

2 Equation (1.2) is also valid for light (iiianta, as may bo soon by dividing both 
sides of Eq. (1.1) by the velocity of light r; for a dirocted l)oam of light p = E/c and 
X = c/v. 

2 For a more detailed discussion than is presented in this section, see the books cited 
above, and L. Pauling and 10. B. Wilson, Jr., “Introduction to (Quantum Mechanics,” 
Chap. II (McGraw-Hill, N(‘w York, 1935). 
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Bohr-Sommerfeld Quantization Rules. The first step in this direc- 
tion was taken by Bohr in 1913, when he made two postulates concerning 
the electronic or extranuclear structure of an atom. The first of these 
was that an atomic system can exist in particular stationary or quantized 
states, each of which corresponds to a definite energy of the system. 
Transitions from one stationary state to another are accompanied by 
the gain or loss, as the case may be, of an amount of energy equal to the 
energy difference between the two states; the energy gained or lost 
appears as a quantum of electromagnetic radiation, or as internal or 
kinetic energy of another system. The second postulate (in agreement 
with that of Planck and Einstein) was that a radiation quantum has a 
frequency equal to its energy divided by Planck^s constant h. 

These two postulates by themselves provided some insight into the 
Ritz combination principle and the Franck-Hertz experiment. To obtain 
specific results for hydrogen, Bohr proposed a simple rule for the selection 
of those circular orbits which are to constitute stationary states: the 
angular momentum must be an integral multiple of 11/2^. A more 
general quantization rule was discovered independently by W. Wilson 
(1915) and by Sommerfeld (1916), thus making possible the application 
of Bohr^s postulates to a wider variety of atomic systems. This rule is 
applicable to Hamiltonian systems in which the coordinates are cyclic 
variables, and states that the integral of each canonical momentum with 
respect to its coordinate over a cycle of its motion must be an integral 
multiple of h. The rule was applied with considerable success to the 
computation of the fine structure of hydrogen, the spectra of diatomic 
molecules, and other problems. 

Practical Difficulties- The old quantum theory encountered practical 
diflSculties in several different respects. It could not be applied to 
aperiodic systems, it provided only a qualitative and incomplete treat- 
ment of 'the intensities of spectral lines, and it did not give a satisfactory 
account of the dispersion of light. Moreover, improvements in experi- 
mental techniques soon showed that there were problems, such as the 
rotational spectra of some diatomic molecules, to which the theory gave 
unambiguous but incorrect answers. 

The correspondence principle was introduced by Bohr in 1923 in an 
effort to make use of the classical theory as a limiting case to infer some 
properties of atomic systems, especially the intensities of spectral lines. 
Although much was achieved in this way, it was clear in the early 1920’s 
that the quantum theory as it then existed was unsatisfactory. 

Conceptual Difficulties. Quite apart from the practical difficulties 
outlined above, the old quantum theory failed to give a conceptually 
satisfactory account of the fundamental phenomena. It was difficult to 
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understand why the electrostatic interaction between a hydrogen nucleus 
and an electron should be effective when the ability of the accelerated 
electron to emit electromagnetic radiation disappeared in a stationary 
state. The mechanism of emission and absorption of radiation in transi- 
tions between stationary states was obscure. The quantization rules 
were arbitrary even when they were most effective. And the assumption 
of a dual character for light (particle-like on emission and absorption 
and wave-like in transit) seemed to be self-contradictory. 

In order to illustrate the conceptual difficulties and the way in which 
they are dealt with by the new quantum mechanics, we consider in some 
detail a simple diffraction experiment, which is illustrated schematically 
in Fig. 1. A light source S illuminates a diaphragm A in which two slits 
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Fju. 1. A cliff ruction oxporitnent in which liKht from S pjihhos tiiroiiKh tlio two hUIs in A to 
form M tiilf ruction pattern at B. 


are cut. A diffraction pattern appears at a pliol,osensitivo screen B, 
and the ejected photoelectrons are most numerous at the diffraction 
peaks. Here we have the radiation behaving as a Avavc during its passage 
from source through slits to screen, but behaving as a stream of light 
(fuanta or photons when it ejects electrons from B. We now know that a 
similar experiment could be set up with matter instead of radiation. 
The diffraction pattern of elec^trons scattered from a crystal (analogous 
to the slits in A) may be detected as a distribution of electron tracks in a 
Wilson cloud chamber (analogous to the screen B), so that the wave and 
particle aspects of matter appear in the same experiment. 

In the situation illustrated in Fig. 1, we might at first suppose that 
the dilTraction pattern is due to an interference between different photons 
passing through the two slits, thus explaining the observations entirely 
in terms of the particle picture. That this is not a sufficient explanation 
may be shown by decreasing the intensity of the light until an average of 
only one photon at a time is in transit between source and screen. The 
diffraction pattern still appears as the distribution of the large number of 
photons accumulated over a sufficiently long tim(\ 1'hus ^ve must con- 
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elude that diffraction is a statistical property of a single photon, and does 
not involve an interaction between photons. From the point of view of 
the particle picture, we may then ask how it is that a stream of inde- 
pendent photons, each of which presumably can go through only one of 
the slits, can produce a diffraction pattern that appeal’s only when both 
slits are open. Or to put the question in another way, how can the 
presence of a slit through which a photon does not go prevent that photon 
from reaching a part of the screen it would be likely to reach if that slit 
were closed? 

Quantum-mechanical Viewpoint. In this question is implicit the 
assumption that the photon actually does go through a particular one 
of the two slits. This assumption is natural from the point of view of the 
classical theory or the old quantum theory since these theories regard a 
photon or other particle as having a definite and determinable position 
at each instant of time. The quantum mechanics, however, discards this 
assumption, and asserts instead that the position of a photon has meaning 
only when the experiment includes a position determination. Moreover, 
this part of the experiment wiU affect the remainder of the experiment 
and cannot be considered separately. Thus from the point of view of 
quantum mechanics, the question asked in the last paragraph is without 
meaning, since it assumes that the photon goes through a particular one 
of the two slits (thus making it possible to close the other slit) when thei’e 
is no provision in the experiment for determining through whicdi slit the 
photon actually goes. 

The quantum mechanics resolves the situation by telling us tliat tlie 
diffraction pattern is destroyed if a sufficiently careful attempt is made 
to determine through which slit each photon passes (see Sec. 4). We 
must then be prepared to forego the customary mental pic-tiire of a 
photon (or an electron) as a classical particle that has at each instant of 
time a position that can be determined without damage to diifraction 
patterns of the type discussed here. Thus classical causality, whic.h 
requires that the motion of a particle at any time bo uniquely detcu-min- 
able from its motion at an earlier time, must also be abandoned. The 
new theory that is forced upon us in this way is so successful in ot,h(M' 
respects as well that, at the present state of knowledge, we must n'gai-d 
such classically incomplete descriptions as a fundamental property of 
nature. 


3. TTNCERTAINTY AND COMPLEMENTARITY 

Before presenting a more quantitative discussion of the diffraction 
experiment outlined in Sec. 2, we consider two pnnciples that expr(‘SH 
in qualitative terms the physical content of the theory of quantum 
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mechanics. We restrict ourselves here to a discussion of their meaning, 
and give arguments for their validity in Sec. 4. 

Uncertainty Principle. The first of these is the uncertainty 'principle, 
developed by Pfeisenberg^ in 1927. According to this principle, it is 
impossible to specify precisely and simultaneously the values of both 
members of particular pairs of physical variables that describe the behav- 
ior of an atomic system. The members of these pairs of variables are 
canonically conjugate to each other in the Hamiltonian sense: a rectangu- 
lar coordinate x of a particle and the corresponding component of momen- 
tum pxj a component Jg of angular momentum of a particle and its 
angular position in the perpendicular (xy) plane, the energy E of a, par- 
ticle and the time t at which it is measured, etc. Put more quantitatively, 
the uncertainty principle states that the order of magnitude of the 
product of the uncertainties in the knowledge of the two variables must 
be at least Planck^s constant h divided by 27r {h = h/2ir = 1.064 X 10"^^ 
erg-second), 2 so that 


A;r • Apx > h 

(3.1) 

A<l> • AJs > h 

(3.2) 

At ‘ AE >h 

(3.3) 


The relation (3.1) means that a component of the momentum of a 
partich^ cannot l)e precisely spec-ified without our loss of all knowledge 
of the corrc^spondiiig component of its position at that time, that a particle 
cannot be pi-ecisely hxjalized in a particular direction without our loss of 
all knowledge of its momentum component in that direction, and that in 
intermediate cases the product of the uncertainties of the simultaneously 
measurable values of corresponding position and momentum components 
is at least of the order of magnitude of h. Similarly, Eq. (3.2) means, 
for example, that the precise measurement of the angular position of a 
particle in an orbit carries with it the loss at that time of all knowledge 
of the (U)mp()n(uit of angular momentum perpendicular to the plane of 
the orbit. Efpiation (3.3) means that an energy determination that 
has an acxuirac.y AE must occupy at least a time inteiwal At h/AE] 
thus if a system inaiTd.ains a pai’ticadar state of motion not long(U’ than 
a tim(i A/, the (Mi(‘rgy of th(^ systcun in that state is uncertain by at least 
the amount AE ^ h/At, mw.o At is the longest time interval available for 
the energy d(‘t(u*minati()n. The smallness of //. makes the uncertainty 
principle of interest primarily in connection with systems of atomic size. 

^ VV. TT(‘is(Mib(‘.rK, Zmln. f. PhnsiJc, 43, 172 (1027). 

2 It. T. UirKc'., Prv. Mod. Plnis. 13, 233 (19-11). 
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As we shall see in Sec. 12, the uncertainty principle may be obtained 
directly from the mathematical formulation of the theory, and this is 
actually the way in which it was first obtained by Heisenberg. 

Complementarity Principle. In order to understand the implications 
of the uncertainty principle in more physical terms, Bohr^ introduced the 
complementarity principle in 1928. This principle states that atomic 
phenomena cannot be described with the completeness demanded by 
classical dynamics; some of the elements that complement each other to 
make up a complete classical description are actually mutually exclusive, 
and these complementary elements are all necessary for the description of 
various aspects of the phenomena. From the point of view of the experi- 
menter, the complementarity principle asserts that the physical apparatus 
available to him has such properties that more precise measurements 
than those indicated by the uncertainty principle cannot be made. 

This is not to be regarded as a deficiency of the experimenter or of his 
techniques. It is rather a law of nature that, whenever an attempt is 
made to measure precisely one of the pair of canonical variables, the other 
is changed by an amount that cannot be too closely calculated without 
interfering with the primary attempt. This is fundamentally different 
from the classical situation, in which a measurement also disturbs the 
system that is under observation, but the amount of the disturbance can 
be calculated and taken into account. Thus the complementarity 
principle typifies the fundamental limitations on the classical concept 
that the behavior of atomic systems can be described independently 
of the means by which they are observed. 

Limitations on Experiment In the atomic field, we must choose 
between various experimental arrangements, each designed to measure 
the two members of a pair of canonical variables with different degrees 
of precision that are compatible with the uncertainty relations. In 
particular, there are two extreme arrangements, each of which measures 
one member of the pair with great precision. According to classical 
theory, these extreme experimental arrangements complement each other; 
the results of both may be obtained at once and are necessary to supply 
a complete classical description of the system. In quantum mechanic's, 
however, the extreme complementary experiments are mutually exclusive 
and cannot be performed together. 

It is in this sense that the classical concept of causality disappears in 
the atomic field. There is causality in so far as the quantum laws that 
describe the behavior of atoms are perfectly definite; there is not, how- 
ever, a causal relationship between successive configurations of an 

^ N. Bohr, Naiurny 121, 680 (1928); Atomic Theory and the Description of 
Nature/^ especially Part II (Cambridge, London, 1934) Rev., 48, 696 (1935). 
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atomic system when we attempt to describe these configurations in 
classical terms. 


4. DISCUSSION OF MEASUREMENT 


In this section we consider two fairly typical measurement experi- 
ments from the point of view of the new quantum mechanics. The first 
is the diffraction experiment that was brought up in Sec, 2, and the 
second is an experiment designed to detennine the position and momen- 
tum of a particle by an optical method. 

Analysis of Diffraction Experiment We first analyze the diffraction 
experiment of Sec. 2 from the point of view of the complementarity 
principle, assuming that the uncertainty principle is valid. Two con- 
trasting arrangements, which would complement each other classically, 



Fig. 2. Tho cxporinionlul urmn^Ginoiit of I, modifier I by tlio luMiLioii of photon 
indioatorH C. 


arc considered here. One of these is illustrated in Fig. 1. Since it is 
assumed that the distance from A to B is large compared to the distance 
between the two slits, and this in turn is largo (U)mpared to tho wave 
length of the light, the distribution of int(‘nsity in the diffraction pattern 
determines to good approximation the angular distribul-ion of the photons 
leaving the slits in A, and hence deiiMTuiiK'S th(' distribution of the 
y components of momentum of the photons Ixyond A. sec.ond 

arrangement, shown in Fig. 2, determiiurs througli whicjh of the two slits 
each photon passes, and hence provides informai/ion on the y coordinates 
of the photons. 

In the second arrangement each plioton iH^gist(‘rs itself as it piisses 
through a slit by bouncing off one of a niiml)er of indicators C placed 
close to A, and giving up to it a ?/ compon(^nt of momemtum that may be 
uncertain by the amount Apy. If we do not want the resultant diffraction 
pattern of many such photons to be dc^stroyed l)y these events, the 
uncertainty in py for a particular photon produ(U‘d by its encounter witli 
an indicator must be substantially smaller than would be recpiired to 
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throw the photon from a maximum of the diffraction pattern at B into a 
neighboring minimum. With a photon of momentum this requires 
that 

« ev^ (4.1) 


For the simple case in which R'^ a the angle 6 is known experi- 
mentally (or from the theory of wave optics) to be given by 



(4.2) 


in terms of the optical wave length X and the distance a between the slits. 
At the same time, we have not learned through which slit this photon 
passed unless the uncertainty Ay of the y position of the indicator that 
recoiled is less than half the distance between sHts. 

Ay < ia (4.3) 

We now use some of the experiments discussed in section 1 [see liq. 
(1.2)] to establish a relationship between wave length and momentum 
that holds equally well for photons and material particles, 

X = A (4.4) 

It is then apparent from Eqs. (4.1) through (4.4) that the requirement 
that we be able to determine through which slit each photon passes 
without destroying the diffraction pattern at B is equivalent to tlu^ 
requirement that 

Ay-Apy<^lh (4.5) 

for each indicator that is used. Since Eq. (4.5) is in disagreement with 
the uncertainty relation (3.1), we may conclude that it is impossible 
to determine through which shts the photons pass without destroying the 
diffraction pattern. 

Discussion of Diffraction Experiment. The situation just analyzed 
shows the intimate connection between the theoretical principles of 
uncertainty and complementarity find the experimental observations 
related to localization and diffraction. It provides an explicit demon- 
stration of the validity of the complementarity principle (represented in 
this case by the choice between the mutually exclusive but (Oassic^ally 
complementary experiments for observing the diffraction and foi' loc^aliz- 
ing the photon) when taken in conjunction with the cxpei*imonl)aIly 
observable properties of matter and radiation. For it shows tliat no 
fundamental difficulty need be encountered with the photon picture so 
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long as we do not insist on the degree of detail in describing the situation 
that is entailed by classical concepts. 

It is, of course, still necessary to ascribe unfamiliar properties to 
the photons in order to explain the experimental observations; the fore- 
going discussion does not show how individual photons can interfere with 
themselves to produce the diffraction pattern,^ Nor on the other hand 
does it show how an electromagnetic wave can eject photoelectrons from 
the screen. 2 Such demonstrations lie beyond the scope of the qualitative 
discussion of this chapter, and require the use of the mathematical 
formalism of quantum mechanics. However, analysis of the diffraction 
experiment from the point of view of quantum mechanics removes the 
difficulty encountered in Sec. 2; the diffrac- 
tion pattern disappear whenever a success- 
ful attempt is made to determine through 
which slit each photon passes. 

Localization Experiment. We now con- 
sider a particular example of the validity of 
the uiKHU’tainty principle, making use of 
a position-momentum determination that is 
typical of a numbeu* of somewhat similar ex- 
periments that have been discussed in con- 
nection with measununents on particles and 
radiation fields.** Wo shall consider here 
the a(^curacy with wliic.h the x components 
of the posil.ion and inomeJitum vectors of a 
mat(U*ial parlhde can l)e deterinined at the 
same time l)y ol)serving tlu^ particle through 
a rather idc'alized mi(u-()S(*.op(^ by means of s(^atter(Ml light. 

Thi'. best resolving powta* of lensL shown in Fig. 3 is known (again 
either (experimentally or from the theory of wave optics) to provide an 
ac(!ura(;y 

A.r (4*()) 

sin € 

in a position chehermination, where X is the wave length of the radiation 
that ent(M*s the !(‘ns, and € is the half angle subl.ended at the particle P 

1 C^li.aptcr VI shows lh(^ (H|iuvalcn(^o of th(^ i]uH)ry of Olmj). TI and th(^ Koncral 
ciiiiintiiin-inoc,lianir.M.l theory of partirles, so far as matU^r is (concerned. 

2 CMijipU'r XTV shows how thi^ theory of tho cdeotromagnctic field can ho modified 
to include qiiantuni elTiud-s. 

3 Sec, for example, W, Ileiseiiherg, “Tho Physical Principles of the C^ianturn 
Theory,” Ohaps. 11, III (University of (Miieago Press, Chieago, 1030). 



Kia. 3. All experiment for the 
localization of a particle P by 
means of one of tho scattered 
(|uanta Q, whhOi is focused by tho 
Ions L to form an imago on tho 
screen S. 
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by the lens. For simplicity, we consider the oiw m whicli rmly < mo 
the light quanta Q is scattered onto the screen . ( » u to 

aperture of the lens, the precise direction in whic I - k Ocl 

into the lens is not known. Then since Eq. (4.4) states ul^ k ni< 
turn of the photon after it is scattered is h/X, tlio uiuMutain > in t-iio 
component of its momentuni is approximately (A/X) 

The X components of the momenta of the pliotoii tu id I u* 
can be accurately known before the scattering takers }>la<*t\ siuco ( Iiorti 
no need then to know the a: components of their posilJons. AIm), if c>\7x* 
position measurement refers to the displacement of the. purlirh* 
respect to the microscope, there is no reason why th(^ tol-al luonnuit n ni of 
the sj'stem (particle, photon, and microscope) need 1h> allcu’t'd thiritiij; f lio 
scattering. Then the uncertainty Ap® in the x ooinpoiuuil ol llic rm 
turn of the particle after the scattering is equal to llu' corrospcmi 
uncertainty for the photon. 


Ap; 


h . 

, - sin €. 

A 


I 1. V) 


If we combine Eq. (4.6) with Eq. (4.7), we seo jii.st iii'icr ( he nr:t 1 i, oil- 
ing process, 

Ax-Ap^r^h C'J.H) 

is the best that we can do for the particle. Tims a realistic iusmu ti f 
of the properties of the radiation gives a result in agrccincnl wiilj 
uncertainty relation (3.1) for the particle. 

This experiment may also be considered from tint jioiiil of view t>f 
the complementarity principle. The complemenl.ury an angcmi-u ( s 1 1 i fTox- 
in the choice of wave length of the observed radiation: snllicictilly .snuxll 
\ permits an accurate determination of the po.sition of die parliclt' jiiwh 
after the scattering process, and large X of its inoiiKMitiiin. 


6. WAVE PACKETS IN SPACE AND TIME 

The relation (4.4) between momentum and wave Icngl li wJiicdx i.M 

known experimentally to be valid for both photons an.) parli,-],.. 
deSrib^f possible to use concentrated lnmel.,.s of wa,v.*.M 

ideas, we shall consider a wave amplitude or wave \ Inil, depeii.N c >ii 

ThisUntitvV .iTl V 

have three ba^c properties. Krst, it can interfere wiili itself so -i i f 
can account for the results of diffraction experime s ^ i -/■ - 

large m magnitude where the particle or nhn+nn • m i*' ’^''fond, it. 

elsewhere A.iid third d/ ivill Via j hkoly to ])(• ;oi(l snuill 
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such quanta. This last is an essential requirement in view of the conclu- 
sion of Sec. 2 that a single quantum of matter or radiation interferes 
with itself rather than with other quanta. In this section we shall con- 
fine ourselves to a qualitative discussion of the one-dimensional case, in 
which the wave function ^ depends only on x and t, and leave the quan- 
titative development for Chap. II. 

Space Packets. A typical form for a concentrated bunch of waves, 
which we shall call a wave packet^ is show in Fig. 4o, where is 

plotted against x for a particular time L The average wave length 



Xo and the approximate extension A.r of the packet arc indicated in the 
diagram. The Pouner integral analysis^ of with respect to x is now of 
interest since it shows how yj/ may be built up out of continuous harmonic 
waves of various lengths. This is indicated in Fig. 46, in which the 
Fourier transform of ^ is plotted schematically against the propagation 
number k = %r/\. 

It can be shown by standard mathematical methods that 

it i ^ (5.1) 

where A/r is the approximate spread in propagfitiou numl)er associated 
with the packet. If now we correlate wave length and momentum tis in 
Mq. (4.4), we see that the spread Ak corresponds to a momentum spread 

Ap = A = -^Ak = h • Ak (6.2) 

Combination of Eq. (5.1) with Eq. (5.2) gives 

Ax* Ap> h (5.3) 

^ S(‘o, for oxainplo, L. A. Pipes, “Applied Miithonuilies for Engineers and Physi- 
cists,” Chap, ni (M(5Graw-Hill, New York, 1046). 
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which agrees with the uncertainty relation (3.1). Thus the uncertainty 
principle for position and momentum of a quantum of matter or radiation 
follows directly from the wave-packet description and Eq. (4.4). 

Time Packets. In analogous fashion, we may examine the depend- 
ence of on the time t for a point x that is typical of the pa(^ket, and 
obtain a time Fourier transform that shows how can be built up out of 
continuous harmonic waves of various frequencies v. In this case the 
relation between the spread in time of yp and the spread in frequency of 
the time Fourier transform of ^ is 

Af • Av ^ ^ (5.4) 

Equation (5.4) may be brought into correspondence with the uncer- 
tainty principle by associating the energy J? of a quantum with the fre- 
quency of the wave that represents it in a manner similar to the 
association of momentum with wave length given by Eq. (4.4). We 
shall make this connection through Eq. (1.1): 

E = hv (5.5) 

which may be infen'ed in the case of photons from the experimental 
discussion of Sec. 1. Combination of Eq. (5.4) with Eq. (5.5) then gives 
the uncertainty relation (3.3). 

The assumption that Eq. (5.5) is valid for matter as well as for radia- 
tion may he made plausible by computing the group velocity^ of a wave 
packet that represents a nonrelativistic particle of mass m, kinetic energy 
and momentum p for which X and p are given by Eqs. (4.4) and (5.5), 
respectively. The group velocity, which is the velocity of the (joni.er of 
the packet, is equal to 

dv _ dE _ d{p^/2m) _ p 
rf(l/X) ^ dp ^ m 

in agreement with the chissical expression for the velocity. This sliows 
that with Eq. (5.5), the wave-packet devseription of the motion of a 
particle agrees with the classical description when the cirmimstunces arc 
such that we can ignore the size and internal structure of the packet. 

Wave Formalism. We see then that quanta of matter or radiation 
can be represented in agreement with the uncertainty principle by wave 
packets that may be superposed to produce interference and whoso 
magnitudes give a measure of likelihood of location, provided tliat use is 
made of the experimentally inferred relations (4.4) and (5.5). It is then 
possible to set up a quantitative formalism based on the matlumal-ical 
^ M. Born, op, cit.j pp. 85, 295. 
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analysis of wave motion. This will be done for matter in Chap. II, using 
the physical principles outlined in this chapter as a guide, and requiring 
always that the result of any calculation reduce to the result of the cor- 
responding classical calculation in the appropriate limit. This require- 
ment is a way of stating Bohr’s correspondence principle, which was 
mentioned in Sec. 2. At the present time, when a reasonably complete 
quantum theory exists, the correspondence principle is mainly of interest 
in assuring that the formalism has the proper classical limit, although it 
may also be of heuristic value in performing new calculations or extending 
the boundaries of the theory. 

It might at first be thought that the exclusive use of a wave formalism 
for the description of matter in the next four chapters would conflict 
with the observed particle-wave duality discussed in Sec. 1 and hence 
disagree 'with the complementarity principle. This is not the case, how- 
ever, for the formalism actually provides an understanding of all of the 
measurable properties of matter, including, for example, the production of 
particle tracks in a cloud chamber. Thus it will be shown in Sec. 30 
that, if a single material particle is represented by a wave function of 
definite momentum and hence completely undetermined position, the 
probability of ionization of two or more molecules of the cloud-chamber 
gas is negligibly small unless the molecules lie very neai’ly along a line 
parallel to the momentum vector. 

It must be emphasized that these remarks are true only if a wave 
function of the typo described in this section is always interpreted as 
representing just one particle of matter, and not the statistical distribu- 
tion of a number of particles. When we wish to describe more than one 
particle, wc must make use of a wave function that depends on the coordi- 
nates of all of them. The analogous analytical description of light quanta, 
which Avill bo taken up quantitatively in Chap. XIV, makes use of a 
somewhat dilTcrcnt approach. This is mainly because photons (unlike 
particles as we consider them here) can be emitted or absorbed through 
intcra(!lj()n with matter, so that their number is not fixed. Thus a photon 
wave func^tion would have to despond on a variable number of parameters, 
and it is desirable to avoid such a situation. 

Problems 

1. Clivo a briof (l(^H(;riptioa of oanli of tho oxporiiiKaiis rcfc^rrod to in Tahl(‘ 1, 
toKothor with tlioir intorprotations. 

2. DoHciriho hrinfly tho Davissoii-Gcrmor and ThorriMon cxporiinents, and the uho 
of the Wilson cloud chamber for the observation of particle trac^ks. 

3. A h(\am of silver atoms for a Stern-Clerlach experiment is produced by eollinuit- 
inp; atoms that vapo^i/(^ from silver held in a funiaee at 1200°C. Tf the beam travels 
1 m(it(?r, use the uncertainty relation to find the order of magnitude of the smallest 
spot tliat can be obtained at the detector. 
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4. Show that if a component of angular momentum of the electron of a hydrogen 
atom is known to be within 6 per cent error, its angular orbital position in the plane 
perpendicular to that component cannot be specified at all. 

6 . Use Eq. (3.2) to show qualitatively that if one component of ang\ilar momen- 
tum of a particle is precisely known, the two perpendicular angular-momentum com- 
ponents cannot be known accurately at the same time. 

6. A 1-ounce rifle bullet takes 0.5 second to reach its target. Regarding the bullet 
as a mass point, and neglecting effects of air resistance and earth motion, find the 
order of magnitude of the spread of successive shots at the target under optimum 
conditions of aiming and firing. 

7. A perfectly elastic ping-pong ball is dropped in vacuum from a height equal to 
ten times its radius onto a perfectly elastic fixed sphere of the same radius. Neglecting 
effects due to earth motion, estimate the largest number of bounces against the fixed 
sphere that the ball can be expected to make under optimum conditions of release. 

8. A beam of monoenergetio electrons is used to excite a particular level of an 
atom in a Eranck-Hertz experiment. If this level is of short duration, owing to radia- 
tion back to the ground state, show that the inelasticaUy scattered electrons that 
have lost energy to produce the excited level will not all be expected to have the same 
final energy. If the excited level lasts about 10“^® second, what is the order of magni- 
tude of the electron energy spread, measured in electron-volts? 

9. Discuss any connections you can think of that exist between the three uncer- 
tainty relations (3.1), (3.2), and (3.3). 
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THE SCHRODINGER WAVE EQUATION 

This and the next several chapters are concerned with the non- 
relativistic motion of a particle in a force field that can be represented by 
a potential energy. A quantitative description of the motion in terms of 
a differential equation, the Schrodmger wave equation^ is developed in this 
chapter, and applied to a simple one-dimensional problem. Various 
assumptions have to be made as regards the structure of the wave equa- 
tion, the boundary and continuity conditions on its solutions, and the 
physical meaning of these solutions. These assumptions are given a high 
degi'ee of plausibility in this chapter and the next by relating them to 
experimental results, mainly those that deal with the diffraction of 
material particles and with the existence of a classical limit to the quan- 
tum mechanics. However, no attempt is made to derive the formalism 
uniquely from a consideration of the experiments. The definitive test 
of the theory must, of course, be its internal consistency and the success 
with which deductions from it agree with particular experimental meas- 
urements; some examples will be worked out in Chaps. IV and V. 

6. DEVELOPMENT OF THE WAVE EQUATION 

The form of the Schrodinger wave equation is obtained in this section 
by generalizing the properties of the wave amplitude of Sg(!. 5. The 
remainder of this chapter presents a discussion of some of the properties 
of the equation and its solutions. 

Traveling Harmonic Waves. The first task is to develop in more 
(juantitative fashion the properties of the onoMlimonsional wave function 
^(.^,0 that was discussed qualitatively in 5. Tt was shown there that 
for a continuous traveling harmonic wave, the wave length and momcm- 
tum arc I’olatcd by hki. (4.4) and the energy and fre<iuency by Eq. (5.6). 
We rewrite these two equations in terms of the universal constant 
h = hlTiir: 

p = hk, ^ = Y (61) 

E = hcj, 0) = 2Trv (b.2) 

A wave function that represents a particle of completely unde- 

termined position traveling in the positive x direction with precisely 

17 
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known momentum p and kinetic energy E, would then be expec^tcjd u > 
have one of the forms 


cos {kx - sin {kx ~ >-« ) 

or some linear combination of them. This follows from ililTriic. tiim ox y"><>r5 
ments like those of Davisson and Germer and of Thomson (soo . \ \ 
and from the requirement that a wave packet of approximnic'ly (Im 
propagation number k and angular frequency w has a group vohxn ty (u j tijil 
to that of a classical free particle of momentum p and energy K |s(U’‘ 10<p 
(5.6)]. 

Need for a Wave Equation. In order to go beyond th(^ very sinii Jl(^st 
problem of a continuous harmonic wave, it is veiy desirable to .‘in 

equation of which both the harmonic waves and more (iOinpli<^ai-cxl wjivc's 
are solutions. An example from a more familiar fi(dd of pliysi(\s should 
help to clarify this point. In the case of three-dimensional sound 
in a gas, it is possible to obtain a solution of the problem of iln^ sejitttudng; 
of BOimd by a rigid sphere by superposing plane harmoni(^ wa^^es t nu'diug; 
in various directions. But it is far simpler to solve the dillVrcnit.uiI 
equation for sound waves directly in spherical polar eoordiiia-tc's. ir tii(* 
temperature of the gas changes from point to point, no progn'ss oin In* 
made in the general case without such a dilferentiul oiniiition. 'J'lic <M)r- 
rect underlying equation for sound waves can bo found from direct <m)H- 
sideration of the mechanical properties of the gns. While (his is iici( 
the case with the equation of which the wave funcd.ions of See. fu-e s< dii- 


tions, it is no less imperative to find the form of the (uiiinl ion. 'I'lie ihhuI 
for this becomes more evident when the wave fumdion is l,o df'serilic 1 lu* 
motion of a particle under the influence of extorniil forc^es; (his si (mil ion 
turns out to be analogous to the propagation of sound \\ !i\-es in jin 
inhomogeneous gas. We shall, therefore, find an rM|ual.ion for \f/, iiiul 
having found it, shall regard it as a more fumlanKuital aKrilnile of ( lio 


wave functions than the harmonic forms (0.3). 

The equation must have two basic propertiivs. h'irsi., i(, nuisl, l>(> 
linear, in order that solutions of it can be superj)osed (,o produce inliT- 
ference effects (in the three-dimensional case) and (,o permi(. (Iii> <-on- 
struction of wave packets. And second, the c()effieien(,s ..f ( he ,.,,ual ion 
must involve only constants such as h and the; mass and eli!irg(' of 1 lie 
particle, and not the parameters of a particular kind of niolion of I lu* 
particle (momentum, energy, propagation numl.(>r, n.nd fns,uenf.v). 
The reason for the latter requirement is that we shall wan(, (o leave op«>ii 
the possibility of superposing solutions that belong (,„ diUVrenl. values of 

cannot be a solution of an ecination that involves tin, iinrainel,..-s in i i h 
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structure. Since differential equations are the easiest to handle, it is 
worth while to try this type first, and it turns out that the requirements 
can be met by a differential equation. 

With all these considerations in mind, we look first at the most familiar 
one-dimensional wave equation, that which describes the motion of 
transverse waves on a string or plane sound waves in a gas : 


dt^ ~ dx^ 


(6.4) 


where y is the square of the wave velocity. Substitution of the forms 
(6.3) into Eq. (6.4) shows that each of the four harmonic solutions, and 
hence any linear combination of them, satisfies this differential equation, 
if and only if we put 

^ A-2 p2 4^2 

where m is the mass of the particle that is to be described by Eq, (6.4). 
Because of the structure of Eq. (6.5) it is apparent that the coefficient y 
that appears in Eq. (0.4) involves the parameters of the motion (E or p ) ; 
we therefore discard this differential equation. 

The One-dimensional Wave Equation. In looking further for a suit- 
able equation, it is helpful to note that differentiation with respect to x 
of wave functions like those of (6.3) has the general effect of multiplica- 
tion of the function by k (and sometimes also interchanging sine and 
cosine), wliilc differentiation with respect to t has the general effect of 
multiplication by oj. Then the relation E = p^/2m, which is equivalent 
to the relation w = hk-f27n, suggests that the differential equation for 
u^hich we an^ looking (‘ontains a first derivative with respect to t and a 
second derivative with respect to x. 




( 6 . 6 ) 


Substitution shows that the first two of the wave functions (6.3) are not 
solutions of l<]q. ((>.()), but that either of the last two maybe (but not both 
at once) if the constant y is suitably chosen. In particular, if we choose 

fo) ihE ih wv 

•» - F< ■ "F " a 

then the third of the wave functions (0.3) satisfies Eq. (6,6). Moreover, 
the value of y given by l^jq. (6.7) involves only the constants h and m. 

We are thus led to the one-dimensional form of the Schrodinger^ wave 

1 E. S(‘hr6(linp;(;r, .4rm. d. Phy»ik, 79, 3(51, 480 (1926); 81, 100 (1926). The present 
treatment is somewhat different from that originally given by Schrodinger. 
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equation for a free particle of mass m, which from Eqs. (6.6) and (6.7) 
may be written 


d^!/ __ A® 


( 6 . 8 ) 


The particular form in which Eq. (6.8) is written is significant in so far 
as its harmonic solution, the third of the wave functions (6.3), makes 
the left side Eyj/ and the right side The fact that the solution 

is complex is not in itself a defect of the formalism. We shall 
have to be certain that all predicted results of possible physical observa- 
tions are expressible in terms of real numbers, and this will supply a 
condition on the detailed interpretation of yp. 

Extension to Three Dimensions. The foregoing one-dimensional 
treatment is readily extended to three dimensions. It is natural to 
rewrite Eq. (6.1) as 

p = Ak, * = |k| = Y (6.9) 


where k is called the propagation vector. Similarly, the third of the wave 
functions (6.3) becomes 

exp [i(k ' r — cot)] (6.10) 

where r is the position vector for the particle. Then by an obvious 
extension of the argument that led up to Eq. (6.8), it is seen that the 
three-dimensional Schrodinger equation for a free particle that is repre- 
sented by the wave fimction ^(r,i) is 


dyp A® „ , 

dt 2m 


( 6 . 11 ) 


A comparison of Eqs. (6.9), (6.10), (6.11), and the classical encu'gy 
equation 



( 6 . 12 ) 


suggests that, at least for a free particle, the energy and momentum can 
be represented by differential operators that act on the wave funcjtion \p. 

a 

E-^ih—y p — > — ih grad (6.13) 

The development of Secs. 7, 8, 10, and 11 will show that these are also 
valid representations when the particle is not free. 

Inclusion of Forces. The next problem is to extend the free-particle 
wave equation (6.11) so that it includes the effects of external forces that 
may act on the particle. We shall assume for the present that these 
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forces are of such a nature (electrostatic, gravitational, possibly nuclear) 
that they can be combined into a single force F that is derivable from a 
potential energy 7. 

F(r,i) = - grad F(r,0 (6.14) 

Just os the classical relation between energy and momentum is used above 
to infer the structure of Eq. (6.11), so it is desirable now to start from the 
corresponding classical relation that includes external forces. This is 
simply expressed in terms of the potential energy 

^ + 7(r,i) (6.15) 

where E is now the total energy, and the first and second terms on the 
right side of Eq. (6.15) are the kinetic and potential energies of the 
particle, respectively. 

Since V does not depend on p or E, Eqs. (6.15) and (6.13) suggest that 
Eq. (6.11) be generalized into 

+ (6.16) 

This is the Schrodinger wave equation that describes the motion of a 
particle of mass m in a force field given by Eq. (6.14). While the intro- 
duction of Eq. (6.1(5) cannot claim as high a degree of plausibility as the 
derivation of the frce-particle equation (6.11), the further discussion 
of the next section should make it more convincing. It is, of course, the 
agreement of solutions of Eq. (6.16) with experiment in particular cases 
that eventually demonstrates the validity and usefulness of this wave 
equation. 

7. INTERPRETATION OF THE WAVE FUNCTION 

The wave function ^(r,i), which is a solution of the wave equation 
((5.16), is now assumed to provide a quantum-mechanically complete 
description of the behavior of a particle of mass m with the potential 
energy 7(r,0, and hence is analogous to the classical trajectory r(^). 
Thus far, the only interpretative guide available to us is that the wave 
function be large where the particle is likely to be and small elsewhere. 
This has to be supplemented with more detailed statements that enable 
us to get out of ^ the maximum amount of information permitted by 
nature, as was discussed in Sec. 3. As with the structure of the wave 
equation, the correctness of our interpretation of the wave function must 
in the end be judged by logical consistency and appeal to experimental 
results. 
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Statistical Interpretation. The phrase “likely to be ” in the preceding 
paragraph, together with the discussion of Sec. 3, indicates the need for 
interpreting ^|/ in statistical terms. We can imagine a very large number 
of identical, independent, nonoverlapping regions of space, each large 
enough to contain all the physically interesting features of the motion, 
in each of which the behavior of a particle with the potential energy 
7(r,0 is described by the same wave function in each case r is 

referred to the origin of the particular region. We then make the assump- 
tion that the numerical result of the measurement at a particular time t 
(in so far as the time at which the measurement is made can be specified) 
of any physically meaningful quantity, such as position, momentum, 
or energy, will in general not be the same for all the regions. Rather, 
there will be a distribution of these numbers that can be described by a 
probability function. 

For example, we have seen in Sec. 5 that the result of a position deter- 
mination is to be regarded as uncertain by an amount of the order of the 
linear dimensions of the Avave function. It is natural therefore to regard 
^ as a measure of the probability of finding a particle at a particular 
position Avith respect to the origin of its region. HoAvever, a probability 
must be real and nonnegative, whereas is complex in general. Wc 
therefore assume that the product of ^ and its complex conjugate ^ is 
the 'position probability density. 

P{U) = (7.1) 

This means that P(Tyt)dxdydz is to be the probability of finding a particle 
in its volume element dxdydz about its point r at the time I, Avhcii a large 
number of precise position measurements are made on independent 
particles each of Avhich is described by the one-particle Avave fuiKdiun 

Normalization of i/'. The probability of finding the particle soine- 
where in the region must be unity, so that Eq. (7.1) implies that the Avave 
function is normalized: 

;|^(r,0|^dr = 1 (7.2) 

where the integral extends over the entire region; here dr is the volume 
element dxdydz. If ^ is a wave packet of the type discussed in Sec. 5, 
the integral in Eq. (7.2) converges, and the numerical coefficient of yj/ may 
be adjusted so that the integral is unity; such normalization does not of 
course change the fact that ^ is a solution of Eq. (0.16), which is homo- 
geneous in \f/. There are, however, Avave functions like that given in 
Eq. (6.10) for AAffiich the integral in Eq. (7.2) does not converge if taken 
over an infinite volume. Such Avave functions require special (H)nsidora- 
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tion, and will be discussed fui-ther in Secs. 10 and 11. For the present, 
we may think of the region of space in which such a wave function is 
defined as being arbitrarily large, but finite; then the integral in Eq. (7.2) 
is over the finite volume of this region and converges, so that normaliza- 
tion is always possible. 

The coefficient of that normalizes it must be independent of the 
time in order that i may satisfy the wave equation (0.16). Thus if 
Eq. (7.2) is satisfied at one instant of time, the interpretation of as a 
position probability density recpiires that the normalization integral be 
independent of the time. That this is actually the ease may be shown by 
computing the time derivative of tlie integral of P over any fixed volume 
V. 

ih f 
ih f 

Here substitution has been made for d^fdt from Eq. (G.Ki), and for d^jz/dt 
from the complex conjugate of Eq. (G.IG). The last integral is obtained 
by partial intc^gration (use of Green’s theorem), where A is the bounding 
surface of th(i region of integration and [ ]„ denotes the component of 

the vecitor in brackets in the direction of the outward normal to the sur- 
face elemcmt (L \ . ' 

We define a vecf.or S(r,/), 

h 

== 'f' ”■ ( 7 . 3 ) 

in terms of \vhi(?h 

= - l^div Sdr = - ( 7 . 4 ) 

In the case of a wave packet, for \vhi(4i ^ vanisln^s at f»;reat distances and 
the normalization intefi;ral C()iiv(M‘j>;cs, the siii*fa(!e intefj;ral is evidently 
zero when V is the entire spae(\ For a wav(^ fuiu^/ion of the type given 
in Eq. (0.10), ^ (am be d(^fined in a finite region V so tliat it vanishes or 
has a periodic structure along the bounding surfacc^s (see Sec. 10). In all 
thes(^ (aises, it can be shown without difTuaili.y tliat tlu^ surface integral in 

^ It is (convenient to adopt an order of fjK^torH siieli thiit. if prcu^edcjs if fsee the dis- 
(uiHHion of I'kp (7.7) h('low|. 
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Eq- (7.4) is zero, so that the nonnalization integral Eq. (7.2) *'On- 

stant in time. , 

ProbabiUty Current Density. The derivation of Bq. (7.4) also shows 
that the differential relation 

+ div S(r,0 = 0 

is valid. This has the familiar form associated with the conBOi Vtition 
of flow of a fluid of density P and current density S, in which thort’^ urt^ no 
sources or sinks. It is thus reasonable to interpret S(r,i) given 1 >.V lui- 
(7.3) as a probability current density. This interpretation makc*H more 
plausible the identification of -’ih grad with the momentum in 
even when a force is present. For then (h/im) grad is the "v^tdocity 
operator, and it is apparent that 

S(r,^) = real part of grad ^ J 

While this interpretation of S is suggestive, it must be realizes 1 1 luit S 
is not susceptible to direct measurement in the sense in which /' II' 

would be misleading, for example, to say that S(r,0 is the avorafi;<* meas- 
ured particle flux at the point r and the time for a moasuron i<Mii ot 
average local flux implies simultaneous high-precision ineasurtuiac ‘fits of 
position and velocity (which is equivalent to momentum) and is tl K'nforc, 
inconsistent with the uncertainty relation (3.1). Ncvorthclt\sf-i, it is 
sometimes helpful to think of S as a flux vector, ospecnally wlu'n i(. 
depends only slightly or not at all on r, so that an accunito A,"<docijy 
determination can be made without impairing the iiS(d‘iilncHs of llic 
concept of flux. 

Expectation Value. The existence of the position probability < Icnsily 
i^(r,0 makes it possible to calculate what we shall call the 
value of the position vector of a particle, which is defined as t-hc» \'(‘ctor 
whose components are the weighted averages of the correspond i ii com- 
ponents of the position of the particle. The expectation valu<’‘ is (lu* 
mathematical expectation (in the sense of probability theory) for ihe 
result of a single measurement, or it is the average of the result s of a 
large number of measurements on independent systems of the dis- 

cussed at the beginning of this section. We write the expectation valuta 
of r as 

<r> = /rP(r,<)dT = /^(r,0r\£'(r,0rfr (7.r>) 

which is equivalent to the three equations 

{x) = ^^xpdT, {y) = (z) = Jt^z^rfT 
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where yp is normalized. The expectation value is a function only of the 
time, since p and P depend on t and the space coordinates have been 
integrated out. 

The expectation values of any other physically meaningful quantities 
can be found in a similar way if they are functions only of the particle 
coordinate r. Thus the expectation value of the potential energy is 

<F> = /7(r,i)P(r,0ci'r = F(r,i)^(r,0dr (7.6) 

A quantity such as momentum or energy must, however, be expressed in 
terms of r and t before a calculation of this type can be made. We assume 
that it is possible to use the differential-operator representations given in 
Eq. (6.13) for this purpose, and justify this assumption with the help 
of the corresponding probability functions in Sec. 10 (for the energy) 
and in Sec. 11 (for the momentum). The question immediately arises, 
however, as to how such differential operators are to be combined with 
the position probability density P. 

This question may be answered by imposing on the expectation values 
the reasonable requirement that 

in analogy with the classical energy equation (6.16). In terms of dif- 
ferential operators, this may be written 

It is apparent that Eq. (7.7) is consistent with the wave equation (6.16) 
only if the expectation value is defined in the general case with the 
operator acting on and multiplied on the left by We therefore 
obtain, for example, 

<P> = y j Sr&d Hr (7.8) 

Like Eq. (7.5), the second of Eqs. (7.8) is ecpiivalent to the three com- 
ponent eciuations 

{pj> = -ih j ^'^dr, (p„) = —ih j 

(p.) = -ih J 

Ehrenfest’s Theorem.^ It is reasonable to expect the motion of a 
wave packet to agree with the motion of the corresponding classical 
1 P. Ehrenfest, Zeita. f. Physik, 46, 455 (1027). 
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particle whenever the potential energy changes by a negligible amount 
over the dimensions of the packet. If we mean by the position 
and momentum'' vectors of the packet the weighted averages or expec- 
tation values of these quantities, we can show that the classical and 
quantum motions always agree. A component of the ^‘velocity" of 
the packet will be the time rate of change of the expectation value of 
that component of the position; since (:r) depends only on the time, and 
the X in the integrand of Eq. (7.5) is a variable of integration, this is 

= I = I ^xf^dr + j 


This may be simplified by substituting for the time derivatives of the 
wave function and its complex conjugate from Eq. (6.16) and canceling 
the V terms. 

- f {- 

~ ^ / ~ iV^^)XAf/]dT 

The second integi’al can be integrated by parts; 

j {y^)x^dT “ ” (grad ^) • grad {xyl/)dT + (.r^grad ^)nhx, dA 

where the integral of the normal component of x\[/ grad ^ over the infinite 
bounding surface A is zero because a wave packet t/' vanishes at great 
distances. A second partial integration, in which the surface integral 
again vanishes, results in 


Thus 


m J ^ dx 


( 7 . 9 ) 


Since (a;) is seen always to be a real number from the structure of Eq. 
(7.6), Eq. (7.9) shows quite incidentally that (p*) is real; this can also be 
shown from the second of Eqs. (7.8) when ^ represents a wave packet, 
by means of partial integration. 
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In similar fashion we can calculate the time rate of change of a com- 
ponent of the momentum” of the particle as 



again substituting from the wave equation and integrating twice by parts. 

Equations (7.9) and (7.10), together with their other components, are 
analogous to the classical equations of motion, 


(it "" in 


§ = - grad F 


They provide an example of the correspondence principle, since they 
show that a wave pa<^ket moves like a classical particle whenever the 
expectation value gives a good representation of the classical variable; 
this is usually the macu'oscopic limit in which the finite size and the internal 
structure of the packet can bo ignored. 


8. ENERGY EIGENFUNCTIONS 

The Schrodinger wave equation ((hlO) admits of considerable sim- 
plification when the potential enei’gy V{t) does not depend on the time. 
It is then possible to exprcjss its genei’al solution as a sum of products of 
functions of r and t separately. 

Separation of the Wave Equation, We consider a particular solution 
of Eq. (0.10) tliat can be written as a product: ^(r,i) = ?/.(r)/(/); a general 
solution (^an then bo written as a sum of siudi separated solutions. If we 
substitute into Eep (0.10) and divide througli by the product, we obtain 

+ (8.0 

Since the left side of Eep (8.1) depends only on t and the right side only 
on r, both sides must be ecpial to the same separation constant, which we 
call E. Then the equation f or / is readily integrated to give 

m 

m = cv' * 
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where C is an arbitrary constant, and the equation for u becomes 

[ - ^ + F(r)] «(r) = Eu{j) (8.2) 

Since Eq. (8.2) is homogeneous in tt, the constant C may be chosen to 
normalize u. Then a particular solution of the wave equation is 

^ iSt 

= u(r)e (8.3) 

Significance of the Separation Constant E. The time-derivative 
operator given in Eq. (6. 13) as a representation of the total energy may 
be applied to the f of Eq. (8.3) to give 

= (8.4) 

An equation of the type of Eq. (8.4) is called an eigenvalue equation; ^ 
is said to be an eigenfunction of the operator that appears on the left, 
and the multiplpng constant E that appears on the right is called the 
corresponding eigenvalue,'^ An energy eigenfunction, like the ^ in Eq. 
(8.3), is said to represent a stationary state of the particle, since 1^1^ is 
constant in time. 

Equation (8.2) is also an eigenvalue equation. It states that u (and 
hence also is an eigenfunction of the operator [— (h-/2m)V^ + F(r)] 
with the same eigenvalue E. It is, of course, to be expected that ^ is 
an eigenfimction of this operator if it is an eigenfunction of the time- 
derivative operator, since according to the wave equation (O.IG), the two 
operators are equivalent not only for separated functions of the form of 
Eq. (8.3) but also for more general solutions. 

We now anticipate the discussion of the physical significance of 
eigenfunctions and eigenvalues that will be presented in Chap. Ill, and 
assume that the energy eigenvalues E are the only possible results of 
precise measurements of the total energy of the particle. It is then of 
interest to inquire whether or not physically interesting solutions u(t) 
of Eq. (8.2) exist for all real values of E. An answer cannot be obtained 
imtil a specification of “physical interest” is found in terms of the bound- 
ary conditions that are imposed on w(r). This specification, and the 
general character of the energy eigenvalues that are associated with 
various types of potential energy function F(r), are considered in the 
remainder of this section. 

' The terms characteristic function and characteristic valice are often used in place of 
eigenfunction and eigenvalue. 
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Boundary ConditiGns at Great Distances. We have thus far encoun- 
tered two classes of wave functions: wave packets that are well localized 
and for which the normalization integral converges, and traveling 

harmonic waves like the function (6.10) that have a constant magnitude 
at great distances so that the normalization integral taken over an 
infinite volume diverges. The fii’st class may be interpreted as represent- 
ing particles that, if free, are initially well localized, or that are restrained 
to a particular region of space by external forces derived from the poten- 
tial energy T^Cr). The second class represents particles that are neither 
localized nor restrained, but travel through the region under consideration 
from one distant part of space to another; such wave functions will Be 
useful in describing the scattering of particles by a field of force. ^ In 
either case, the wave functions are bounded at great distances in all 
directions. 

Continuity Conditions- The time-independent wave equation (8.2) is 
a second-order linear differential equation in r. Thus so long as V{t) is 
finite, whether or not it is continuous, a knowledge of the wave function 
and its gradient along a suriace makes it possible to integrate the equa- 
tion to obtain the wave func.tion at any point. It is reasonable, therefore, 
to require that the wave function and its gradient be continuous, finite, 
and single-valued at every point in space, in order that a definite physical 
situation can be represented uniquely by a wave function. These 
requirements also have the conscquenc^c that the position probability 
density P(r) and the probability current density S(r), defined in Sec. 7, 
are finite and continuous everywhere. 

Boundary Conditions for Infinite Potential Energy. If F(r) is infinite 
anywhere, the appropriate boundary condition can be established by a 
limiting process that starts from a finite V and the above continuity 
conditions. 

Suppose, for example, that there is an infinite discontinuity in V 
across a continuous suriace, so that the potential energy is finite on one 
side of it and + oo on the other, and wc wish to determine the boundary 
conditions on u(r) and grad u at this surface. The essential features of 
the problem are retained if we replace the continuous sui-face by the 
plane that is tangemt to it at the point of interest, and the continuously 
changing potential energy on one side of the surface by a constant poten- 
tial, which can, without loss of generality, be chosen to be zero since any 
constant change in V is equivalent to an equal change in E. We choose 

^ Another possible class eonaists of wave funetions that become infinite at large 
distances; however, these are not of pliyaical interest, since wo have no reason to be 
concerned with particles for which the position probability dciifTity bec^oinos inde- 
finitely large in remote regions of space. 
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the origin of coordinates at the point of interest and the x axis perpendicu- 
lar to the tangent plane. 

The wave equation (8.2) then separates in the three space coordinates, 
and the dependence of w on 2 / and z is not affected by the discontinuity in 
y at the plane a; = 0. We wish therefore to solve the one-dimensional 
wave equation 

where V{x) =0 for a; < 0, y(a;) = Fo for a; > 0, and we eventually 
pass to the limit Fo ^ . If we assume that 0 ^ E < Vo, the general 

solutions of Eq. (8,5) are 

u(x) ^ A sin ax + B cos ax, x <0, a — + 

u(x) = + De^*, x > 0^ i3 = + 

The boundary condition that u be bounded at great distances requires 
that we set D = 0. Then the continuity of u at a; = 0 gives the relation 
B = Cy while the continuity of du/dx gives the relation a A = — jSC. 
Since p becomes infinite when Vo does, and the solution for a: < 0 must 
be finite, the second relation shows that C becomes zero as Vo-^ 00 , 
thus also making B zero; A is not determined from these relations, but 
might be fixed by normalization. 

Thus the boundary conditions at a surface at which there is an infinite 
potential step are that the wave function is zero and the component of 
the gradient of the wave function normal to the surface is not determined. 
The assumption above that E < Vo evidently not a restriction since 
Vo eventually becomes infinite. For E < 0^ the sine and cosine in the 
solution for a; < 0 are replaced by hyperbolic sine and cosine (which is 
permissible since the solution need hold only near a: == 0), with no change 
in the final result. It should be noted that both P and vanish as 
a: — > 0 from the negative side, so that they are continuous at a; = 0 even 
though duldx is not. 

A boundary surface of this type represents a perfectly rigid, impene- 
trable wall, since in the analogous classical situation a particle of any 
finite energy would have its x component of momentum reversed instan- 
taneously on contact ^vith the surface. 

Energy Eigenvalues in One Dimension. Energy eigenfunctions that 
represent particles that are restrained to a particular region of space by 
the potential energy (first class) are always characterized by discrete 
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eigenvalues, while eigenfunctions that do not vanish at great distances 
(second class) possess a continuous range of eigenvalues. This may- 
be seen qualitatively by considering the nature of the solutions of the one- 
dimensional wave equation (8.5). 

We shall suppose at first that Vix) becomes equal to some constant 
value, which may be taken to be zero, for sufficiently large positive and 
negative Xj and that E < 0. A classical particle with this total energy E 
cannot escape to infinity, and indeed c,an exist in the region only if E is 
greater than or eqtial to the smallest value Vuxm of T(.r). The permitted 
form of the wave function for \x\ large enough that T = 0 is evidently 
where /3 = +{—2mE/h^)K These two solutions for large positive 
and negative x can be extended in toward some intermediate point, say 
a; = 0, by making use of the wave equation and the continuity conditions. 
At this point, u can always be made continuous by choosing the arbitrary 
multiplying constants for the positive and negative x solutions appropri- 
ately. Then a little reflection shows that for an arbitrary value of S, 
they will not meet with the same value of diijdx. There may, however, 
be particular values of E for which both u and duldx are continuous at 
a: = 0. The conditions for this can be seen in the following way. 

In the regions in which E < d’^ufdx^ is positive, and so u is 

convex towards the x axis. Thus the two solutions that are continued in 
from ± 00 have opposite signs for their ratios of slope to value, 

I dll 
li d.v 

so long as they are in regions for whicdi E is always less than V. This is 
illustrated in Fig. bb for the potential shown in Fig. ba; both (dioices of the 
sign of u are shown for a: < 0. The points at wliic>h E = V(x) are (uilled 
the turning points {TP) of the classicail motion, sinc.e they are the limits of 
the motion of a classical particle of energy A', at whi(;h the particle tiirns 
around or reverses its motion. At these points d'hi/dx^ = 0, and u has 
zero curvature. 

Clearly what is needed to make the two solutions join smoothly is a 
region in which E > F(aO, in order that dhifd/x^ may be negatives and u 
be concave toward the x axis; this may permit tlu^ ratios of slopes to value 
to become c(iual. Figure be shows the two solutions (airricMl in until they 
meet, but for a somewhat small value of A\ so that when th(‘. //’s ai’c made 
the same at a: = 0, the slopes are not eciual (solid curv(is), or when the 
slopes arc made the same at x = 0, the 7/’s are not e(pial (dotUul (uirve on 
left and solid curve on nght). Figure bd shows a soinewhai. larger (less 
negative) value of A, and Fig. 5e a still larger value, ''riic valiK’s of E 
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and are indicated on the u axis, and the turning points are indicated 
on the X axis, in the last three cases. 

Discrete Energy Levels. We see then that an eigenfunction that 
satisfies the boundary and continuity conditions and that represents a 
particle bound by the potential energy V{x)y can exist for the particulai’ 





u(x) 




!tP X 

(e) 

■V • 
mm. 


Pig. 6. (a) Potential energy function and (b) solutions for large |a:| ; (c) and (e) show that 

either the wove function or its slope is discontinuous at a; ~ 0 for values of E that are 
smaller (more negative) and larger, respectively, than the energy eigenvalue shown in id). 

value of E illustrated in Fig. 5rf. In analogy mth the classical situation, 
a necessary condition that such an eigenfunction exist is that 7min < 0, 
in which case E lies between "Fmin and 0; as in the classical case, this con- 
dition is also sufficient in one dimension although it is not in three dimen- 
sions (see Prob. 10, Chap. IV, and Secs. 9 and 15). 

If the potential energy well illustrated in Fig. 5a is sufficiently broad 
or deep, there will exist another eigenfunction corresponding to a larger 
energy eigenvalue that is still negative. Figures 6a, 6, and c show a series 
of wave functions analogous to those shown in Figs. 5c, d, and c'for suc- 
cessively increasing (successively less negative) values of E; both signs 
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of « are Bho™ loi r < 0. Thus FigB. 5d and 6b show the eigenfunctions 
for the two lowest energy eigenvalues or energy levels of a particle boun^ 
by the po ,enl,uil well (.r). It :s easy to see by an extension, of the fore- 
going qualitative arguments that if there are any higher discrete energy 
levels, eaeh eigeidunetum has one more node than that corresponding to 
the next lowest eigenvalue. 

Ihus lor a iiotential energy that approaches a finite constant value as 
X > ± , there may bo a finite number of discrete energy levels, or in 





Fia. 0. Solutions for fliifncioiitly broad or deep potential and larger (less negative) values 
of E than tlio.so hIiowu in 5. E increases in going from (a) to (6) to (c) and is an 
oigouvnluo in (/>), whore tho wave function and its slope can both be continuous at a: =0. 

aomn c^ustis jin infinite* number (if V{x) falls off slowly enough for large 
l.r|), (leponding on l'(.r) and the mass of the particle. However, if 
V (a;) — > + CO as X — > :!: CO , an argument like that given above shows that 
there will always b(^ an infinite number of discrete energy levels; apart 
from arbitrary mult.i])lying constants there will be just one eigenfunction 
u{x) for oa(!li of 1-1 k‘S(\ 

Continuous Energy Eigenvalues. It is possible to find eigenfunctions 
that ()l>(*y the boundary and continuity conditions for all -energy eigen- 
valu(*s lhal. (‘XcumhI Uk* smaiku’ of the two numbers 7(+oo) and F(— <»). 
If, for (^xainpb*, llu* ])()Umtial energy has the form illustrated in Fig, 5a, 
then soluiions of 1\h) wave equation can be found for all positive values 
of E, This is be(!aus(^ (die solutions for large \x\ are of the form 

A sin Q!l:r| + B cos a\x\, a = + 



( 8 . 6 ) 
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and there is no reason why both terms should not be kept. Thus it is 
always possible to adjust the phase of each of the wave functions for large 
\x\ (which is equivalent to adjusting the ratios A/B for the solutions for 
large positive and negative x) so that they join together smoothly when 
continued in to a; == 0. 

The classical terms periodic (or multiply periodic) and aperiodic are 
sometimes used to designate the particle motions associated with discrete 
and continuous energy eigenvalues, respectively. 

Discrete and Continuous Eigenvalues in Three Dimensions. We 
shall assume without further discussion that all the foregoing results can 




Fig. 7. One-dimensional square well potential with (a) perfectly rigid walls and (6) finite 
potential step. 

be taken over in a natural way for the three-dimensional wave equation 
(8.2). We can expect that if F(r) — » + oo as r oo in all directions, 
there will be an infinite set of discrete energy levels extending to + oo . 
If V (r) is bounded as r — > co in some direction, there may be a finite or an 
infinite number of discrete levels, depending on the form of F. In this 
case, the discrete energy levels cannot exceed the smallest value that 
F(co) has in any direction. For values of E larger than this smallest 
F(oo), the energy eigenvalues ^cover a continuous range extending to 

9, ONE-DIMENSIONAL SQUARE WELL POTENTIAL 
As a simple explicit example of the calculation of discrete energy levels 
of a particle in quantum mechanics, we consider the one-dimensional 
motion of a particle that is restrained by reflecting walls that terminate a 
region of constant potential energy. Two simple types of potential 
energy are considered. Figure 7a shows a situation in which V{x) = 0 
for — a < a; < a, and V{x) = +00 for \x\ > a, corresponding to per- 
fectly rigid, impenetrable walls at the points x = ±a. In Fig. 7b the 
increase in potential energy at the walls is abrupt, but finite, so that 
V(x) = Fo for la:| > a; because of its appearance, this is often called a 
square well potential. The motion of a classical particle with total energy 
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E less than “Fo is the same for both these potentials; but as we shall see, 
the quantum-mechanical behavior is different. In general, an abrupt 
finite increase in potential energy at the boundaries of a region forces a 
particle toward the interior of the region. Such a potential may be 
thought of as a limiting case of a potential of the type shown in Fig, Sa, for 
which the force —dVIdx is always directed in toward a; = 0. The force 
associated with a square well potential is zero except at the boundaiies, 
so that the particle is acted on by no force except a sudden impulse 
directed toward the origin as it passes the points rr = ± a. 

Perfectly Rigid Walls. It was shown in Sec. 8 that the wave function 
must vanish at the points a; = ±a, when the potential energy has the 
form shown in Fig. 7a. From Eq. (8.6) the wave equation for la;| < a 
is simply 


ft'-* d^u ^ 


(9.1) 


which has the general solution 

u{x) — A sin OLX + B cos ax^ a = + (9.2) 

Application of the boundaiy conditions at x = ±a gives 

A sin aa + B (U)s aa = 0, 

— /i sin aa + B (!os aa = 0, 

from which wo obtain 

A sin aa = 0, U (h)S aa = 0 

Now we do not want both A and B to zero, since this would give 
the physi(^ally uninteresting solution u = 0 ovciry where. Also, we cannot 
make both sin aa and cos aa zero for a given vtilue of a or E. There arc 
then two possible chusses of solutions: For the first class 

A = 0 and cos aa = 0 

and for the sec.ond (dass 

/i = 0 and sin an = 0 

'rhus aa = n7r/2, where n is an odd integer' for llie first (dass and an 
even integer for the second edass. The two class('is of solutions and th(dr 
energy eigenvalues arc then 


u{x) = 13 cos 

Za 

. . . . riTTX 

u{x) = A sin 

2a 

Sind^ 


n odd 


II even 


in both cases 
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It is evident that n = 0 gives the physically uninteresting result u = 0, 
and that solutions for negative values of n are not linearly independent 
of those for positive n. The constants A and B can easily be chosen in 
each case so that the eigenfunctions u{x) are normalized. 

There is thus an infinite sequence of discrete energy levels that cor- 
respond to all positive integer values of the quantum number n. There 
is just one eigenfunction for each level, and the number of nodes of the 
wth eigenfunction that are within the potential well is n — 1. These 
results are in agreement with the discussion of Sec. 8. It is interesting to 
note that the order of magnitude of the lowest or ground-state energy 
level is in agreement with the uncertainty relation (3.1). The position 
unceiiiainty of order a implies a momentum uncertainty at least of order 
ft/a, which in turn implies a minimum kinetic energy of order ft2/ma\ 
Finite Potential Step. When the potential energy has the form shown 
in Fig. 76, it is necessary to supplement the general solution (9.2), which 
is still valid for |a;| < a since Eq. (9.1) is unaltered there, by a solution for 
|a;l > d. The wave equation in this region is 


~ H” XT + 1^0^ = Eu 
2m dx^ 


which has the general solution for E < Vo (bound states) 

Uix) = Ce-P- + ^ = + (9.3) 

The boundary conditions at x = ± « discussed in Sec. 8 require that we 
set D = 0 if Eq. (9.3) is to represent the solution for x > a, and C = 0 
if the solution is for x < —a. 


We now impose on the solutions (9.2) and (9.3) the requirements that 


u and du/dx be continuous at cr = ± a. 

A sin «a + J5 cos aa = a A cos aa — aB sin aa = — 

—A sin aa A- E cos aa = aA cos aa + aB sin aa = 

from which we obtain 

2A sin aa = (C — D)e“^“, 2aA cos aa = — D)e''^“ (9.4) 

2B cos aa = {C + D)c-^“, 2aB sin aa = /3(C + D)e-P^ (9.5) 

Unless 4=0 and C D, Eqs. (9.4) have as their consequence 

a cot aa = —/S (9.6) 

Similarly, unless B = 0 and C = — D, Eqs. (9.5) give 

a tan aa = P (9.7) 
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Now it is impossible for Eqs. (9.6) and (9.7) to be valid at once, since 
on elimination of ^ this would require that tan^ aa = — 1, which in turn 
would make a imaginary and jS negative, contrary to Eq. (9.3). Also, we 
do not want and D all to vanish. Thus the solutions may again 

be divided into two classes: For the first class 

A = 0, C = D, and a tan aa = 
and for the second class 

B = 0, C - — Z?, and a cot aa = 

Energy Levels. The energy levels are found from a numerical or 
graphical solution of Eqs. (9.6) and (9.7) with the definitions for a and jS 



Fig. 8. GraphioJil Holiitioii of Eq. (9.7) for three valuen of Vua®; vertical dashed lines are 
the firat two aMymijtotcH oi rj = £ tan £. 


given in Eqs. (9.2) and (9.3). A simple graphical method for effecting 
this solution is described horn, since it shows (piite clearly the way in 
which the number of discrete levels depends on Fo and a. We put 
f = aa, t? = jSa, whence Eep (9.7) becsomes { tan f with 


e + 


2mVoa^ 


Since f and t; are restricted to positive values, the energy levels may be 
found in this case from the intersections in the first quadrant of the curve 
of f tan g plotted against with the circle of known radius (2mFoa^/ft2)*. 
The construction is drawn in Fig. 8 for three values of Vua^; for each of 
the two smaller of these values, there is one solution of Eq. (9.7) and for 
the largest there are two. 
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Figure 9 is a similar construction for the solution of Eq. (9.6) in which 
the energy levels are obtained from the intersections of the same circles 
with the curve of — f cot f in the first quadrant. The smallest value of 
V ou* gives no solution, and the two larger values each give one. Thus the 
three increasing values of V oCt* give altogether one, two, and three energy 
levels, respectively. 

It is clear from Figs. 8 and 9 that for a given particle mass, the energy 
levels depend on the parameters of the potential energy through the com- 
bination Foffl*. For FoO^ between zero and there is just one 



^o. 0. Graphical solution of Eq. (0.0) for three values of Foa*; vertical dashed line is the 
first asymptote of i; = -{cot f. 


energy level of the first classj for V oot* between and four times this 

value, there is one energy level of each class, or two altogether. As Foa“ 
increases, energy levels appear successively, first of one class and then of 
the other. It is not difficult to see from Eq. (9.2) that when ordered 
according to increasing eigenvalues, the nth eigenfunction has n — 1 
nodes. 

Parity. It follows from the foregoing discussion that the eigenfunc- 
tions of the first class are even with respect to change in sign of x [u(— a;) 
= w(a;)], while the eigenfunctions of the second class are odd [M(-a:) 
= — w(a:)]. This division of the eigenfunctions into even and odd types 
is not accidental, and will now be shown to be a direct consequence of the 
fact that the potential energy function V(x) is symmetric about a; = 0. 
If we change the sign of x in the wave equation (8.6) 


tPu{jx) 
2m dx^ 


-f V{x)u{x) = Eu{x) 


(9.8) 


/ 
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d^u(—x) 
2m dx^ 


+ V(x)u(—x) = Eu^—x) 


Then u{x) and u(—x) are solutions of the same wave equation with 
the same eigenvalue E. Unless there are two or more linearly inde- 
pendent eigenfunctions that correspond to this energy level, these two 
solutions can differ only by a multiplicative constant: 

w(— a:) = eu{x). (9.9) 

Changing the sign of x in Eq. (9.9) gives u{x) = eui—'x). From these 
two equations it follows at once that 

= 1 or € = ±1 


Thus all such eigenfunctions of a symmetric potential are either even or 
odd with respect to changes of sign of x. Such wave functions are said 
to have even or odd ‘parity. 

If an eigenvalue has more than one linearly independent eigenfunc- 
tion, these eigenfunctions need not have a definite parity: they need not 
be even or odd. However, we v.iin easily see that linear combinations of 
such eigenf line (/ions can be found such that each has even or odd parity. 
Suppose that an eigi'iifuncd-ion ?i(ar) does not havc^ a definite parity. It 
can always be written 

u(x) = u,(x) + Ua(xX 


where Ua(x) = i[nCx) + ?/( — .?;)] is even and u„(x) = ^[^/(a;) — u('-x)] is 
odd. Then if tlie wave equation (9.8) is symmetric!, we can write it as 


dhi, , ,,, 




pj)n, = 0 (9.10) 


On (!hanging the sign of x in Eep (9.10), we obtain 


7 /. ? 


+ {V - E)u,.. + 


hr dhi„ _ 

2nn.'^ ^ 


E)iio = 0 (9.11) 


Addition and subtraction of Eejs. (O.IO) and (9.11) shows that Ua and Uo 
are separately solutions of the wave ecpiation with the same eigenvalue E. 

A Simplified Solution. Knowledge that the solutions possess a defi- 
nite parity sometimes simplifies the determination of the energy levels, 
since then wo need only find the solution for positive x. lOven solutions 
have zero slope and odd solui/ions have zero value at x = 0. If, for 
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example, we wish to find the even solutions, Eqs. (9.2) and (9.3) can be 
replaced at once by 

u{x) = B cos ax, b < X < a 

u{x) “ X > a 

Instead of making both u and dufdx continuous at a; = a, it is enough to 
make the ratio {l/u){du/dx) continuous at a; = a, since the normalizing 
constants B and C are eliminated thereby. This gives Eq. (9,7) at once. 
Similarly, the odd solutions are 

u(x) = A sin ax, 0 < x < a 

u(x) = X > a 

Then continuity of {l/u){du/dx) at a; = a immediately gives Eq. (9.6). 

Problems 

1. Use the arguments of Sec. 6 to set up a differential equation for that involves 
a second time derivative of in the case of a free particle. Discuss any solutions that 
this equation has that are not shared by the free-particle Schrodinger equation. 

2. Show directly that {pa) is real for a wave packet. 

8. Show that for a three-dimensional wave packet 

4. Calculate the energy levels and plot the eigenfunctions for the three bound 
states in the potential of Fig. 7b when Voa* « Compare with the first three 

states for the potential of Fig. 7a. 

6. Discuss the relation between the energy levels for the potential of Fig. 7b and 
those for the potential: V{x) = +«, a; < 0; 7(x) = 0, 0 < a; < a; V{x) = Fd, 
X > a, 

6. Show that if the potential energy V (r) is changed by a constant amount every- 
where, the wave functions are unchanged. What is the effect on the energy 
eigenvalues? 



CHAPTER III 


EIGENFUNCTIONS AND EIGENVALUES 

In Chap. IT, tho Schrodinger wave equation was developed and 
applied to a simple problem. Some discussion of the physical interpreta- 
tion of the wave function was given in Sec. 7. This relied for the most 
part on the computation of average or expectation values of operators 
that represent various physical quantities. In addition, however, a 
position probability density was introduced. Thus at the present point 
we are in a position to calculate from the wave function any property 
that depends on tho spatial distribution of a particle (for example, the 
mean-square deviation of its position from the average), whereas we can 
only calculate average values of operators other than position. Clearly 
what is needed to round out our interpretation of the wave function is a 
means of computing probability functions for operators in general; this 
would incidentally enable us to obtain the expressions for expectation 
values used in Sec. 7 without separate assumptions. 

In this chapter, we first set up three physical postulates from which a 
complete interpretation of the wave function (^an be derived, and then 
apply them to a discussion of the total energy and momentum of a 
particle, and to an illustrative problem. TIk^ interpretation presented in 
Sec. 7 can then be seen to be a special case of the more general treatment 
given here. 

10. INTERPRETATIVE POSTULATES AND ENERGY EIGENFUNCTIONS 

We stai't with the wave function ^(r,/), whi(!h is a solution of Eq. 
(6,16) and ch^scrilx^s tlu^ motion of a parti(^l(^ that has mass m and poten- 
tial energy V(r). From this wave function we wish to ol)tain as complete 
a description of the properties of the motion of the parti<fie iis is consistent 
with tho iin(iertainty relations discussed in 3. 

Dynamical Variables as Operators. We first postulate that each 
dynamical variable iJiat rrlafcs Lo the motion of the parlide rmi be represented 
by an operator, Tho operator may be simply a multiplication operator 
such as r for the position, or it may be a dilTerontial operator such as 
— fA grad for the momentum. 

With each operator can be assocaated an eigem value equation, doifined 

41 
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near the begiiming of Sec. 8, Thus with the operator ^ may be aesoci- 
ated the equation 

where is the eigenfunction of corresponding to the eigenvalue ca. 

Our second postulate is that one or another of the eigenvalues to is the 
oTily possible result of a precise measurement of the dy^iamical variable 
represented by This implies that the eigenvalues of all operatoi’S that 
represent physically significant variables are real numbers. 

Expansion in Eigenfunctions. We assume that all the eigenfunctions 
of any dynamical variable constitute a complete set of functions in the 
sense that an arbitrary continuous function can be expanded in terms of 
them. This is a mathematical, not a physical, assumption, and mil 
be discussed further below in connection with energy and momentum 
eigenfunctions. 

Suppose now that a particular wave function ^ is expanded in terms 
of the eigenfunctions of the operator 12. We adopt the statistical inter- 
pretation of yp given at the beginning of Sec. 7, according to which there 
are a large number of identical, nonoverlapping regions of space, in 
each of which is a particle described by We then make measurements 
of the dynamical variable represented by 12 on each of these particles. 
Our third physical postulate is that the number of measurements that result 
in the eigenvalue o is proportional to the square of the magnitude of the 
coefficient of u^ in the expansion of xp. This enables us to associate a 
probability function with any dynamical variable. A corollary is that 
we are certain to measure a particular eigenvalue co only when the wave 
function that describes the particle is the corresponding eigenfunction Uu- 

Rather than develop the consequences of these postulates for an 
arbitrary operator 12, we consider here the total energy of the particle, 
and in Sec. 11 the momentum. Most of the results obtained are readily 
applicable to other physical operators. 

The Total Energy Operator. According to the uncertainty relation 
(3.3), a precise measurement of the total energy of a particle cannot be 
made in a finite length of time. Thus if the total energy is to have a 
definite value, it is essential that the potential energy F(r) be inde- 
pendent of the time. Then the operator — (A‘-*/2m)V2 + y(r), which is 
shown in Sec. 8 to be equivalent to the total energy operator ih d/dt, has 
eigenfunctions u(r) that need not involve the time. The energy-eigen- 
value equation is Eq. (8.2), 

[- + 7(r)j««(r) 


( 10 . 2 ) 
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where the eigenfunction usij:) corresponds to the eigenvalue and obeys 
the boundary and continuity conditions of Sec. 8. 

As discussed in Sec. 8, the energy eigenfunctions can be divided into 
two classes: those which are well localized and are associated with discrete 
eigenvalues, and those which remain finite at great distances and possess 
a continuous range of eigenvalues. 

Normalization in a Box. It is often desirable to be able to treat these 
two classes on the same basis; this can be accomplished by enclosing the 
particle under investigation in a box of arbitrarily large but finite volume. 
The simplest physical situation to which this approach is applicable is 
one in which the walls of the box are perfectly rigid, so that, as shown in 
Sec. 8, the wave function vanishes there. Then the discussion of Sec. 8 
shows that all the eigenvalues are discrete. If the box is large in com- 
parison with the dimensions of physical interest in the problem, the 
eigenvalues that were discrete in the absence of the box are practically 
unaffected, since before the walls were introduced the wave functions were 
extremely small there. Also, the eigenvalues that were continuously 
distributed in the absence of the box are very closely spaced; this is 
shown explicitly for a free particle in Sec.. 11. 

It is more convenient to assume that the wave functions obey 'periodic 
boundary conditions at the walls of the box than that they vanish there, 
since it is then possible to get a simpler description of the momentum 
eigenfunctions (see Sec. 11). We choose the finite region to be a cube of 
edge length L centered at the oiigin, and require each wave function 
to have the same value at corresponding points of opposite faces of the 
cube, and the same derivative normal to the wall. These boundary 
conditions make the otherwise continuous eigenvalues discrete, since 
the phase of the eigenfunction at great distances is no longer arbitrary 
[see the discussion of ICq. (8.G)]. As with the rigid-walled box, the pres- 
ence of the walls has a negligible eflect apart from imparting discreteness 
to the otherwise continuous eigenvaliK^s and providing a finite volume in 
which these wave func lions can be noi’malized; wo shall continue to use 
the word continuous'^ to describe these functions, even when box 
normalization is used. 

Orthonormality of Energy Eigenfunctions. The integral /|u^(r)|2rfT, 
which converges in any (;ase for th(^ (lis<o'(^l;(^ S(^t of cigcmfiiiKdions, con- 
verges for all eigcmfiinetions wh(^n tlu^y are normalized in the box of 
finite volume L^. The eoefficient of Ub can then be chosen so that this 
integral is equal to unity, and the ?/-ft?(r) are normalized. 

We now show that the cigonfuiK^tions corresponding to two different 
eigenvalues E and E' are orthogonal; that is, that the integral of the 
product of one of them and the complex conjugate of the other over the 
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common domain of the functions is zero. From Eq. (10.2) we have that 
UB»{t) satisfies the equation 

[ - + 7(r)] = ^W(r) (10.3) 

where we assume in accordance with our physical interpretation that E' 
is real; this is verified below. We multiply Eq. (10.2) by ub^ and Eq. 
(10.3) by usy integrate over the volume L®, and take the difference 
between the two resulting equations. The V terms cancel, and leave 

— ^ j {ue»V^Ub — = (jS? — £^') y* u^Usdr (10.4) 

The integral on the left side of Eq. (10.4) can be transformed by 
Greenes theorem into a surface integral over the surface A of the cube. 
J {Ub'V^b — 'iijBV^a')dr 


/ 

h 


div {ua> grad Us ub grad UE')dr 
(US' grad Ue — ub grad UB')ndA 


(10.5) 


where the subscript n designates the component of the vector in the 
direction of the outward normal to the element of area dA. Since the 
imposition of periodic boundary conditions gives each wave function and 
its normal derivative the same values at corresponding points of opposite 
faces of the cube, the outward normal derivative has opposite signs on 
opposite faces, and the surface integral in Eq. (10.6) vanishes. Then 
Eq. (10.4) tells us that if E 5*^ E', ub and ub' are orthogonal.^ 

An energy eigenvalue E is said to be degenerate when two or more lin- 
early independent eigenfunctions Wi, W2, . . . correspond to it. Orthog- 
onal linear combinations of degenerate eigenfunctions can be found in 
many different ways. For example, Ua == aiui + can be made 
orthogonal to 'Wi by choosing the constant coefficients ai and such that 


^ JwiWadr 

02 i\Ui\MT 

this choice does not interfere with the normalization of Ua^ and Ua is still 
an energy eigenfunction with the eigenvalue E. Evidently the choice 
^ It is apparent that this proof of orthogonality can be applied to a discrete set of 
eigenfunctions even though the box is not introduced, since the w^s vanish rapidly at 
great distances and the surface integral, which is then over a sphere of infinite radius, 
is zero. The continuous set of energy eigenfunctions can also be treated without 
using the box normalization (as is the continuous set of momentum eigenfunctions in 
Sec. 11). See E. C. Kemble, **The Fundamental Principles of Quantum Mechanics” 
(McGraw-Hill, New York, 1937) ; Sec. 30 of Kemble's book presents a discussion of 
this problem and references to the original papers. 
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of orthogonal linear combinations is not unique. By an extension of this 
procedure, all the energy eigenfunctions can be made orthogonal to each 
other even though some of the eigenvalues are degenerate. 

Such a set of eigenfunctions, each of which is normalized and orthog- 
onal to each of the others, is called an orthonormal set of functions. We 
specify an orthonormal set of nondegenerate energy eigenfunctions by 
the relation 

piZij^(r)w«(r)r/T = dgjs' (10.0) 

where is the Kronecker 5 symbol that equals unity if E = E' and 

is zero otherwise. If there is degeneracy, Eq. (10,6) must be replaced by 

Ju£'s'(r)‘Wjfs(r)^'^ ~ (10.7) 

where the index 5 distinguishes between orthogonal degenerate eigen- 

functions. It is often convenient to omit Explicit mention of s and use 
Eq. (10.6) for degenerate situations as well, in which case the index s is 
implied. 

Reality of the Energy Eigenvalues. We can now see directly that E 
is a real number, as has been assumed. We multiply Eq. (10.2) by Unir) 
and integrate over the box of volume L^. If Uw is normalized, the result is 

S = — ^ j UKV^Ukdr + j V{x)\uk\Ht 

which maybe expressed in terms of expectation values as (l/2m)(p^) + (F). 

The second term (V) is real since its integrand is real. The first term 
(!an be shown cxplic^iLly to be real by means of a partial integration. 

— j UiNhindT = j (grad Uk) • (grad UK)dr — 'a/^(gl•ad uy)ndA, 

The volume integral is evidently real, and the surface integral [like that 
in Eq. (10.5)] vanishes bccanse of the periodic boundary e.onditions at 
the walls of the box. Ti, is interc'sting i,o note that (p-^) cannot be negative. 

Expansion in Energy Eigenfunctions. As mentioned near the begin- 
ning of this section, wo make the mathcunatie-al assumpti(m that all the 
eigenfunctions Uis{x) of the total energy operator constitute a complete 
set of functions in the sense that an arbit-rary continuous function can be 
expaiid(^d in terms of thcin.^ Then if wo have any wave function T/^(r) 
at a parti(uilar instant of time that is normalized in the box and obeys 
periodic boundary conditions at the walls, the assumed existence of the 
expansion 

^(r) =2 A£!Ue{t) 

E 

^ For further diacusflioii see E. C. Kemble, op. ciL^ Chap. IV and Sec. 30. 


( 10 . 8 ) 
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makes it possible to find unique coefficients Ae that do not depend on r. 

The coefficients in the expansion (10.8) can be determined by multi- 
plying both sides by He' and integrating over the box. We assume that 

the order of ^ and j dr can be reversed^ and obtain 

j UB'(T)i(j)dr = f Us'ir)us(T) dr = S AeSejs' = Ae'. (10.9) 


with the help of Eq. (10.6) or (10.7). 

The Closure Property. Substitution of the expression (10.9) for As 
back into Eq. (10.8) gives 

if-Cr) = ^ [ / j M*(r) 

which we rearrange to give 

^(r) = j i^(r') Wiif(r')wif(r) j dr' (10.10) 

Since ^(r) is an arbitrary continuous function, Eq. (10.10) implies that the 
bracketed part of the integrand vanishes unless r' = r, since otherwise 
the value of ^ at the point r given by Eq. (10.10) would change when the 
values of ^ at other points r' 5*^ r are changed, and this is contrary to 
the assumption that ^ can have an arbitraiy form. Moreover, the 
integral of the term in brackets must be unity when the volume of inte- 
gration includes the point r' = r. We conclude therefore that 


^ ue(t')ub(jc) = 0 , r ' 7 *^ r ; 

( 10 . 11 ) 

j ^ WB(r')tiB(r)dr' = 1 


if the volume of integration includes the point r' == r. 

Equations (10.11) describe the closure property of the orthonormal 
functions Ue{x) and are seen to follow directly from their completeness as 
expressed by Eq. (10.8), whether or not they happen to be energy 
eigenfunctions. 

Probability Function and Expectation Value. The second and third 
physical postulates presented at the beginning of this section state that 
the energy eigenvalues are the only possible results of precise meosure- 

^ The propriety of changing the order of summations and integrations must in 
principle be investigated separately in each case. The mathematical considerations 
entailed are beyond the scope of this book, and we shall always assume that such 
interchanges are permissible in situations of physical interest. 
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ment of the total energy, and that the probability of finding a particular 
value E when the particle is described by the wave function ^(r) is 
proportional to It is easily seen that the proportionality factor is 

unity, for if we put for the energy probability function 

P{E) = |Ab|2 (10.12) 

we see that P(E) sums to unity. 

^ P{E) = 2 y UBir)^|/(T)dT j UK{r*)^{P)dr' 

E E 

^ I f t2fi(r)uj?(r') j drdr' 

= f \HT)[^dT = 1 

since ^ is normalized; use has been made here of Eqs. (10.11). 

We can also compute the average or expectation value of the energy 
from the probability function. 

{E) = Y^ EP{E) f ujc(T')^(r')dT' (10.13) 

E E 

If we substitute fur Eiijs from Eq. (10.3), the first integral in Eq. (10.13) 
can be integrated twice by parts as follows: 

I EMr)Hr)dr = j ^(r) [ - ^ + FCr)] Mr)dr 

= j Mr) [ - ^ + nr)^ Hr)dr 

The two surface integrals that result from the partial integrations vanish 
bc(‘,ause of the periodic boundary conditions on uh and yj/. Thus with the 
help of E(is. (10.1 1), Eq. (10.13) becomes 

(i-) = ^ / Mr) + j u,(T')Hr')dT' 

E 

= f j Hr') j[- + F(r)] ,^(r)| w«(r)M«(r')] drdr' 

E 

= |^(r)[-^V"-+ F(r)]^(r)rfr (10.14) 

The result embodied in Eq. (10.14) confirms the supposition made in 
Sec. 7 that the expectation value of an operator is to be calculated by 
inserting the operator between if (r) and ^(r), so that it operates just on 
the latter, and integrating over r. 
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General Solution of the Schrodinger Equation. If the potential 
energy 7(r) is independent of t and we know the solution of the SchrSd- 
inger equation (6.16) at a particular time, we can write down a formal 
expression for the solution at any time. We expand yp{r^t) in energy 
eigenfunctions at the time in which case the expansion coefficients 
depend on the time. 

^(r,i) = ^ .AB(i)Wii7(r), Asit) = j Uii{r)yp{ryt)dT (10.16) 
Substitution of Eq. (10.15) into the wave equation (G.IG) gives 


ih 



l 


Ae(JI)Euk(j) 


(10.16) 


Because of the orthonormality of the Eq. (10.16) is equivalent to 

= EAsit) 


which may be integrated at once to give 


xEjt — fo) 

Aeit) — AE{tQ)e ^ (10.17) 

Note thatP(E) = \AEit)\^ = |AE(io)|^is constant in time. 

Thus if ^(r,i) is known at the time t = io, the solution at any time t 
is given in terms of Eqs. (10.15) and (10.17): 


or 


Hr,t) = y AMa)e * Usit) ( 10 .] 8 ) 

Asito) = f iU7i(r')}f/(r'ffo)dr' 

Hu) = / [2, A J ^(r',/„)<iT' (10.19) 


The solution (10.18) is a linear combination of the separated solutions 
(8.3) obtained earlier. 


11. MOMENTUM EIGENFUNCTIONS 

The eigenfunctions of the linear momentum operator —ih grad pro- 
vide a second instructive example of the application of the general ideas 
developed at the beginning of the last section. They also are of con- 
siderable usefulness in solving problems of physical interest. 

Form of the Eigenfunctions. The momentum eigenfunctions are 
solutions of the three eigenvalue equations 

-ih grad 'itp(r) = piip(r) 


( 11 . 1 ) 
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or: 


Q Q 

- ^ WpW = P»Mp(r)» -ifi- ^ 'Mp(t) 


They have the form 


- ih-^UTfit) = P.ttp(r) 


^^p(r) = C exp 


p««p(r) 


where (7 is a normalization constant. 

It is convenient, as in See. (1, to change from the momentum vec- 
tor p to the propagation vector k = p/ft, and rewrite the momentum 
eigenfunctions 

U]t{x) = C exp • r (11.2) 

These are eigenfunctions of the momentum operator with the eigenvalues 
ftk. 

Box Normalization. As with the energy eigenfunctions discussed in 
Sec. 10, we can n^stried- the domain of the Wk(r) to an arbitrarily large but 
finite cubical box of volume (umtered at the origin, at the walls of 
which the fuiKdions obey periodic, boundary c.onditions. Then Wk is 
normalized if (7 = L"-. Also, k is no longer an arbitrary real vector; 
its components are restricted to the valiums 


- 27r/^a: j 2irny . 2iraz 

A* = “7- ^ Aj/ = — h't = — 7— 
Jj Jj Jj 


(11.3) 


where Uy^ and n* are positive or negative integers or zero. The hi)uc- 
ing of neighboring k vectors and of their energy eigenvalues ft‘'^k'-*/2?n (;an 
be made as small as d(^sire<l by making L suffic.icntly large. 

It is interesting to note that the momentum eigenfuiKddons (11.2) 
cannot exist within a box that has perfe(^tly rigid walls, since thes(^ eigen- 
functions do not vanish anywhere. This is analogous to the classical 
situation in whic.h the monauitum of a particle that is reflected from a 
rigid wall is not conserved. On th(^ othew hand, the (;ul)ical box with 
periodic, boundary conditions is 0 (iuival(uit to a situation in which the 
entire infinite spacer is divided up into adjac.(mt cubess and all wave func- 
tions are pewiodio throughout space with th(^ pcu’iod Ij along oa(di of the 
three cartesian axes. If the periodicity of tlu^ spiw^e is carried over to 
the analogous classical situation, a particle passing through a wall would 
be equivalent to one that strikers that wall and appeal’s at the correspond- 
ing point of the opposite wall with its momentum vector unchanged. 

The orthonormality of the momentum eigenfunctions 


W'k(r) = OX]) 7k • r 


(11.4) 
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is readily established. For integration over the volume L® 

1 f\L r\L fhL 

J-^L J -JL 

= ^kxlJkvlv^hMl, = 5ki (11-6) 

where the d^s are Kronecker 5 symbols and use is made of Eqs. (11.3). 
Orthogonality could also have been shown by the more general method 
used in Sec. 10 for the energy eigenfunctions [see Eq. (10.4)]. 

The Dirac 5 Function. It was stated in Sec. 10 that continuous sets of 
eigenfunctions can be handled without introducing the box with periodic 
boundary conditions (which has the effect of making the set discrete with 
an arbitrarily small spacing of eigenvalues) . This can be shown explicitly 
for the momentum eigenfunctions with the help of the Dirac 5 function/ 
which can be defined by the relations 

5(x) =0 if X9^0, fS(x) dx^ I (11.6) 

where the region of integration includes the point 3 ; = 0. An equivalent 
definition is that for an arbitrary function f{x) that is continuous at 
a? = 0, the equation 

Sfix)d(x)dx = /(O) (11.7) 

is valid, where again the integration includes the point a; = 0. 

It is apparent from a comparison of Eqs. (11.6) and (10.11), or of 
Eqs. (11.7) and (10.10), that the bracketed quantity in Eq. (10.10) can 
be expressed in terms of 5 functions : 

UB{r^)us{r) = d{x — x^)d{y — y*)h{z - 2 ') s 5(r — r') (11.8) 

Comparison of Eqs. (11.8) and (10.6) shows that the closure properiy 
is a kind of orthonormality of the eigenfunctions with respect to summa- 
tion over the eigenvalues. 

A Representation of the 5 Function. The definition (11.6) or (11.7) 
shows that 5(a:) is an exceedingly singular function. It may be thought 
of qualitatively as being zero everywhere except at rr = 0, and being so 
large there that the area between it and the x axis is finite and equal to 
unity. More formally, it can be represented in a number of different 
ways as the limit of a sequence of analytic functions. 

A particular representation that is quite useful involves (sin gx)lTX 
as a function of x^ where p is a positive real number. This has the value 
g/ir at a: =» 0, oscillates with decreasing amplitude and with period 2ir/f/ 
as \x\ increases, and has unit integral from x==— to x = + 00 indepen- 

1 P. A. M. Dirac, “The Principles of Quantum Mechanics,” 2d ed., Secs. 20, 21 
(Oxford, New York, 1936). 
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dently of the value of g. Thus the limit of this function os g > oo has all 
the properties of the d function: it becomes infinitely large at a? = 0, 
it has unit integral, and the infinitely rapid oscillations as \x\ increases 
mean that the entire contribution to an integral containing this function 
comes from the neighborhood of x = 0. We can therefore put 


d{x) = lim 

f7— ► 00 


sin gx 
ttx 


(13.9) 


Normalization in Terms of the d Function. The representation (11.9) 
of the 5 function can be used to set up an orthonormality integral like that 
given in Eq. (11.5), where now we do not impose the box normalization 
but allow the momentum eigenfunctions to have the form (11.2) over all 
space with all real vectors k. The integral /wi(r)?/k(r)n?T is the product of 
three integrals, each of which can be expressed in terms of a 5 function: 



Qi(.k;n—lx)x ^ 

D — > OO J ~‘0 


= lim 

g-^ 00 


2 sin jf 7 (A’^ — Zx) 


A X ^aJ 

= 2t5(A*x — 


( 11 . 10 ) 


Thus the momentum eigenfunctions in infinite space can be writtc^n 


Wic(r) = (87r®)~i exp • r (11.11) 

in which cfise the orthonormality relation bec-omes 

/77i(r)wk(r)r/r = 5(/rx - DKK - luW\ - Q ^ 5(k - 1) (11.12) 

It will l)(^ shown in Sec. 12 that Uie box and 5-fun<d.ion normalizations 
of the momentum eigenfunctions give the same final rc^sult in a typicail 
problem. 

Some Properties of the 8 Function. It is important to note that, 
because of its singular <^hara(*,t(u*, tlu* 5 fumduon cannot ho the end result 
of a calculation, and has moaning only so long as a, subsociuont integration 
over its argument is carried out. With tliis understanding wc can wi'ito 
down some relations betwecui 8 functions.^ 


«(;r) = S{-x) 

S'{x) = -S'i-x) 
x8(x) = 0 

a:5'(:r) = -5(:r) (1 I.IH) 

8{ax) = a'~^8(x)j a > 0 

8{x^ — a^) = 2a~^[8{x — a) + 8(x + a)], a > 0 

J5(a — x)8{x — b)dx = S(a — b) 

f{x)8(x — a) = f(,a)8(x — a) 

Here, a prime denotes dilTerentiation with respect to the argument. 

^ P. A. M. Dirac, op. cit.j p. 76. 
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Each of the first six of these equations can be established by multiply- 
ing both sides by a continuous, differentiable function /(:c) and integrating 
over a;. For example, the fourth of Eqs. (11.13) gives 

j f{x)xd'(x)dx = y ^ W{x)]dx 

= - J’5(a;)[/{ic) + xf'{x)]dx = - iKx)d(x)dx 

where the boundary terms that result from the partial integration vanish. 
Thus xS'{x) has the same effect when it is a factor in an integrand as has 
— 5(x). Similarly the seventh of Eqs. (11.13) means that the two sides 
give the same result when multiplied by /(a) or f{b) and integrated over a 
or b. The last equation is verified by integrating both sides over either 
X or a. 

Closure. The closure property of the momentum eigenfunctions, 
with both box and d function normalization, can be established without 
the help of the completeness assumption that was made in Sec. 10 for 
the energy eigenfunctions. With box normalization, the expression 
analogous to the left side of Eq. (11.8) is 

OQ oo ao 

^ wk(r')wk(r) = L-* X I! S 

k CO ni“— 00 

This is readily evaluated in the limit of largo L, in whic.h case the sum- 
mand changes by a negligible amount as each n changes by one unit. 


We can then regard n* as a continuous variable and replace ^ by 

/•ao r 03 

f ~ (^/27r) f ^ dkx. We thus obtain 


^ Mk(r')«k(r) £-;^ (87r»)-i 

= S(x - x')d(3j - y')S{z - z’) = 3(r - r') (11.14) 

on making use of Eq. (11.10). 

A similar calculation can be carried through, using the 5-function 
normalization, in which case we obtain from Eqs. (11.11) and (11.10) 

iM^>kir)drjc = JJJuk(^')^ik(r) dKdkydK - 6(r ^ r') (11.15) 


The closure relation (11.14) or (11.15) shows that the momentum 
eigenfunctions are orthonormal with respect to summation or integration 
over the eigenvalue k as well as with respect to integration over the posi- 
tion vector r. 
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Expansion in Momentum Eigenfunctions. An ai-bitrary continuous 
function \^(r) can be written in terms of the 5 function as 

Ki) = ;if'(r')S(r - t')dr' (11.16) 

If we substitute the left side of Eq. (11.14) in place of 5(r — r') in Eq. 
(11.16), we obtain 

l^'(r) = f i'ir') y Uk(t')ut(r)dT' = V Ak?/t(r) 

^ V r ^11.17) 

= /wk(r')\A(rOrfT' 

Similarly, if we substitute for 5(r — r') from Eq. (11.15), we obtain 
lA(r) = fi'{t')jilk{t')uicir)drdT' = /AkMk(r)dT6 (11.18) 

with the same expression for A^. Equations (11.17) and (11.18) show 
that it is possible to expand an arbitrary function in momentum eigen- 
functions that are normalized either in a box or by means of 5 functions.' 

Probability Function and Expectation Value. The momentum 

probability function associated with a normalized wave function vf'(r) is 
proportional to jAkl^. The proportionality factor is unity, since if we 
put 

P(k) = (11.19) 

it is easily shown in analogy with the summation of Eq. (10.12) that 


2p(k) = l and jP(k)dTk = l 


( 11 . 20 ) 


for the box and 5-fiinotion normalization, respectively. 

The expec^tatioii value of the momentum when box normalization is 
used is 

(p) = ft^kP(k) == y kuk(r)iA(r)t/r j (11.21) 

k k 

From the complex conjugate of Eq. (11.1), we can replace kwk(r) by 
i grad 'Wk(x’)- ''rhen the first integral in Eq. (1 1.21) can be integrated by 
parts and the surface integral vanishes because of the periodic boundary 
conditions on \f/ and Uk> Thus with the help of Eq. (11.14), Eq. (11.21) 
becomes 

(p> = -ih y j Uk{r) grad i{r)dr j nit{r')^(r')dT' 

= ~ih /J'.f(r')[grad i^(r)]5(r - t^hdr' . -o') 

■ = -iAjif(r) grad i/'(r)dT ^ ‘ ^ 

1 These results, while not rigorously established hero, arc equivalent to the mathe- 
matical theorems on the expansibility of functions in Fourier series and Fourier 
integrals. 
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This is in agreement with the second of Eqs. (7.8). 

When 5-function normalization is used, the details of the calculation 
are very similar to those given above, except that the surface integral 
that results from the partial integration is over a sphere of infinite radius; 
it is zero because yp becomes vanishingly small at great distances. This is 
consistent with the supposition that ^ is normalized; otherwise neither 
/P(k)drfc nor (p) have any physical meaning. The result of the calculs^- 
tion in this case is the same as Eqs. (11.22) and (7.8). 

12. MOTION OF A FREE WAVE PACKET IN ONE DIMENSION 
The motion of a free particle (no external forces) in one dimension is 
described by the Schrodinger wave equation (6.8). The study of this 
motion provides an interesting application of the expansion techniques 
developed in Secs. 10 and 11. As a first step, we find the minimum value 
of the uncertainty product given in Eq. (3.1), and the possible forms of 
the one-dimensional wave packet that correspond to it, all at a definite 
instant of time. The structure of this minimum packet is the same 
whether or not the particle is free, since this form can be regarded simply 
as an initial condition on the solution of the Schrodinger equation for 
any V. However, the analytical work involved in finding \f/ at other 
times is especially simple in the force-free case. 

The Minimum Uncertainty Product.^ In order to find the minimum 
value for the uncertainty produce Ax • Ap, we must first define Avhat we 
mean by Ax and Ap, While many expressions are possible, the simplest 
to handle analytically is the root-mean-square deviation from the mean, 
where the word ^^mean’’ implies the expectation value of Sec. 7. 

(Aa;)* = {{x - {x}y} = <ar*> - <2a;(a:)> + {(xy) = {x^} - (x>* i\ 

(Ap)* = ((p - (p»*) = <p*> - (p)* ^ ■ 

Here the equalities follow directly from the general definition of expecta- 
tion value given in Sec. 7. If now we put 

a ^ X - <x), <p) = -ih [4 - (12.2) 

then 

(Axy(Ap)^ = j j " 

= ^{^){oi^)dx ^ Q\p){Pil/)dx (12.3) 

The transformation of the a integral in Eq. (12.3) is obvious; the similar 
transformation of the /3 integral follows from a partial integration when 
^W. Heisenberg, “The Physical Principles of the Quantum Theory, “ pp. 17-19 
(University of Chicago Press, Chicago, 1930). 
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we remember that ^ is a normalized wave packet, which vanishes at 


X = ± oo . 

The inequality 



fffffix 

JW^ 


dx ^ 0 


where all integrals are from x = — oo to + qo , is obviously true, and the 
equality is applicable only if / = yg, where 7 is a constant. From this 
inequality we obtain at once 

If now we replace /by cop and (j by Eq. (12.3) becomes 

(Aa;)2(Ap)2 ^ \S{^W^P)dx[^ = \S^oifiypdx\^ (12.4) 


The last term in Eq. (12.4) can be written 
- ISa) + i(a(i + I3a)]\pdx\- 

= — I3(x)\pdx\‘^ + + I3a)}pdx[^ (12.5) 

The cross t(^rm in tlie prodiKit that is omittc^d on th(‘ right side of Eq. 
(12.5) can Ix^ secui to vanish wlum use is made of th(^ reflation 

= j-pad'i' <fx == J (^^)(a^) r/.r = j^l^axp dx, 

which is ohtaiiHMl by using piii*(.ial inl,(*gi‘ation and nunemlxu’ing that a 
is real. Now from Mep (12.2) 


{ad — da)-p = ^ih 


1 

il 

(12.()) 

AVo thus obtain iVorn E(is. (12. 1), 

(i2.r,), 

and (l2-()) 


(AxyiApy ^ w 

or 

A.r ■ Ap ^ 

(12.7) 


wheni (h(^ (X|iialil.y (\*in hold only if llu' se(',ond t('rin on ilu^ right side of 
Eq. (12.5) is /(‘ro. 4’his is IIk^ pr(x*ise (^xi)i’(\ssic)n of tlu^ Ibnscniborg 
uncertainly ndation (3.1), wIh'ti tlu^ unc(n'tainl.i(*s A,r and Ap are defined 
as in 10(1. (12.1). 

Form of the Minimum Packet. It follows from tln^ fon^going deriva- 
tion that tb(^ minimum unc(M‘tninty prodind, is attained only when two 
conditions are fulfilled. 


axp = yp\p 
+ Pa)‘pdx = 0 


( 12 . 8 ) 

(12.9) 
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Equations (12.8) and (12.2) give us a differential equation for ^ 

which is readily integrated to give 

l/'(a;) = N exp {x - (a:))® + (12.10) 

where JV is an arbitrary constant. 

Equation (12.9), with the help of Eq. (12.8), becomes 

(7 ^ 

which evidently requires that 7 be pure imaginary. Then since we want 
Eq. (12.10) to represent a wave packet for which the integral of con- 
verges, 7 must be negative imaginary. The magnitude of the constant N 
can now be fixed by normalizing \p. 


M^dx = 1 

Similarly, 7 can be determined by requiring that 

■ J(:r — {x)y\^p\‘^dx = 

The integrals are readily evaluated, and lead to the normalized minimum 
wave packet 

^Pix) = [27r(Axy]-i exp [ - (12.11) 

Momentum Expansion Coefficients. The one-dimensional momen- 
tum eigenfunctions analogous to Eqs. (11.4) and (11.11) are 


Uk(x) = ( 12 . 12 ) 

for normalization in a one-dimensional “box^^ of length L, and 

y'k{x) = (27r)-le"’® (12.13) 


for fi-function normalization. Since for a free particle the wave equation 
has the simple form of Eq. (6.8) 


.1 __ __ 


(12.14) 


the momentum eigenfunctions are also eigenfunctions of the energy.^ 

^ The converse is not necessarily true, since there are two solutions of Eq. ( 12 . 16 ) 
for k (positive and negative) for each value of Ek. 
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Thus any solution of the wave equation can be written in a form analogous 
to Eq. (10.18), 

or j dk^ Ake * Uk{x)j (12.15) 


where the Ah are independent of x and t] the entire time dependence is 
contained in the exponential factor. Equation (12.15) is readily verified 
to be a solution of Eq. (12.14) by direct substitution, provided that 




2m 


(12.16) 


The problem of finding the motion of a wave packet is thus resolved 
into finding the expansion coefficient A* at some particular time, say 
t = 0, and then using Eqs. (12.15) and (12.16) to find TpiXyt) at other 
times. At i = 0, the exponential factors in Eq (12.15) are unity, and 
we may use the one-dimensional analogue of the second of Eqs. (11.17) 
to find Ah. 

Ah = Juh{x)\l/(xj0)dx (12.17) 

The limits on the integral are x = ±^L or x = ± «, according as box 
or 5-function normalization is used. The momentum probability func- 

iEkt 

tionP(/r) = \Ake ^ |2 = |Aa;|^ is independent of the time, so that (p) and 
Ap, for example, are constants. 

Change with Time of a Minimum Packet. As a simple specific exam- 
ple, we take i/{XyO) to have the form of Eq. (12.11) with (x) = (p) = 0, 
so that the wave packet initially is centered at = 0 and has zero average 
momentum. Then using box normalization, Eq. (12.17) gives 

At: = [2irZ/’(A.'t)”]-l oxp 1^- 

-[ 

where L is assumed to be so large that the contribution to the integral 
from la:| > can be neglected. Substitution into (12.15) gives the 
wave function for general values of / 

UfcCz) (12,19) 


J 


(12.18) 


where k = %rn/L and n takes on all positive and negative integer values 
and zero. As in Sec. 11, L may be taken arbitrarily large, n can be 
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regarded as a continuous variable and the summation replaced by Jdw, 
which in turn is the same as {L/%r)^dk. Thus 


- [- 4(AX). + (2.7.^) ] 

The position probability density is then 





(Aa;)^ + 




• exp 


( \ 


(^)2 \ 1 } 

^ Am^iAxyW 


( 12 . 21 ) 


Equation (12.21) is of the same form as l^(a;,0)l^, except that (Ax)^ 
is replaced by (Ax)* + ft*iV4m*(Ax)*, which is equal to (Ax)* + 

Thus the center of the packet remains at x = 0 while the breadth of the 
packet increases as t departs from zero in both past and future directions. 
The smaller the initial uncertainty in position; the larger tlm uncertainty 
in momentum and the more rapidly the packet spreads; the time-depend- 
ent part of the above expression, t(Ap)/m, is simply the distance traveled 
by a classical particle of momentum Ap in the time t. 

Use of the 5-function normalization does not alter the results of the 
foregoing calculation. The expression for Ak given in Eq. (12.18) is 
to be multiplied by (L/27r)^;inEq. (12.19) the summation is to be replaced 
directly by Jd/r, thus eliminating a factor L/27r; finally, Uk in Eq. (12.19) 
is to be multiplied by {L/2t)K These three factors cancel, and so Eqs. 
(12,20) and (12.21) are unaffected by the choice of normalization of the 
momentum eigenfunctions. 

Classical Limit. We have seen in Sec. 7 that a wave packet always 
moves like a classical particle in so far as the expectation values of its 
position and momentum are concerned. However, classical dynamics is 
only useful as a description of the motion if the spreading of the wave 
packet can be neglected over times of interest in the particular problem. 

As a simple example of the kind of parameter that indicates when the 
classical limit is realized, we consider a wave packet that corresponds to a 
classical particle moving in a circular orbit of radius a and period T. We 
shall assume that this packet is sufficiently well localized so that the 
potential energy does not vary appreciably over its dimensions. Then 
the classical theory can provide a useful description of the motion only 
if a wave packet like that discussed above spreads by an amount that is 
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small in comparison with a during a time that is large in comparison 
Avith T, The smallest spread of a packet during a time interval of mag- 
nitude t is attained when Ax is chosen to be of order {ht/in)K We thus 
require that {ht/m)\ <^a when T, This condition may be simply 
expressed by saying that the angular momentum ^mraa^jT of the particle 
must be very large in comparison with Hi. Thus for most atomic systems, 
where the angular momentum of each electron is of order ft, a wave packet 
corresponding to a well-localized particle spreads so much in one period 
that this type of description of the motion is not of physical interest. 


Problems 

1 . Given three degenerate eigenfunctions that are linearly independent although 
not necessarily orthogonal. Find three linear combinations of them that are orthog- 
onal to each other and arc normalized. Are the three new combinations eigen- 
functions? If so, ore they degenerate? 

2. Show that so far as the one-dimensional motion of a particle is concerned, the 
functions Ux'{x) — b{;x — x') for all real x' constitute a complete orthonormal set, 
and that each of them is an eigenfunction of the position variable x with the eigen- 
value x'. Set up the position probability function and compare with that obtained 
in Sec. 7. 

3. If the potential energy V{x) in a one-diineiiHional problem ia a moiiotonio 

increasing function of x and independent of the time, show that tho functions 
Uv*{x) =» (dV'/f/.r) 5{x — x') for all real x', where V' = V(x')j constitute a com- 

plete orthononnal set of eig(mfun(!tionH of tlic potential (mergy with eigenvalues V'. 
Find the probability function for the potential energy, ajid show that it has tho prop- 
erties that would be oxpectcMl of it. 

4. What changers are needed in the discussion of the momentum eigenfimetions 
given in See. 11 if normalization is (tarried through in a box of rectangular parallel- 
epiped shapes ratluT than in a box of cubic, al shape? 

6. Find two othc,r n^pn^sentatioiis for the Dirac S function like that given in Eep 
(ll.O), 

6. V(U’ify each of lOqa. (11 .13) involving 5 functions. 

7. Show that the two hkia. (11.20) are correct: that tho inoinentiim probability 
function delincd in I'kis. (11.10) and (11.17) for a normalizc^d ^ sums or integrates to 
unity. 

8. Tho expression in scpiare brackcU in th(^ integrand of lOq. (10. 10) enables one to 

calculate ^ at tiim* L in terms of \f/ at Lime U. If this (^xpn^ssion is called Jo) in 

the one-diimmsiomil case, then Sliow tin t for a free 

particle in one dimension 


Af/ \ [iin(x — x')-l 

a{x,x,t,u) = - <„)J L 2ft(t - <„) J 


Assume that lias tlu^ form of Llu^ normalizcrd minimum wave packet (12.11) at = 0; 
use thc^ above result to find ^ and at another time L 
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DISCRETE EIGENVALUES : ENERGY LEVELS 

The formalism that was developed in Chap. II and elaborated in Chap. 
Ill will now be applied to the explicit computation of discrete energy 
levels and the corresponding eigenfunctions. The next chapter will 
take up situations in which the energy eigenvalues are continuously 
distributed. Thus we are concerned here with bound states in which the 
particle is restrained by the external forces (potential energy) to a 
particular region of apace, and in the next chapter with collision problems 
in which the particle can approach from and recede to infinite distance. 

The relatively few potential energy functions F(r) for which analytic 
solutions of the wave equation (8.2) are possible, are important beyond 
these immediate problems, since they often serve as bases for approxi- 
mate calculations on more comphcated systems. 

18. LINEAR HARMONIC OSCILLATOR 

The one-dimensional motion of a point mass attracted to a fixed 
center by a force that is proportional to the displacement from that center, 
provides one of the fundamental problems of classical dynamics. Its 
study is important not only for itself, but also because more complicated 
systems can often be analyzed in terms of normal modes of motion that 
are formally equivalent to harmonic oscillators. The linear harmonic 
oscillator in quantum mechanics is similarly of importance both for the 
study of such problems as the vibrations of individual atoms in a mole- 
cule, and for the analysis of more complicated systems such as crystals 
and (as we shall see in Chap. XIII) quantized wave fields. 

Asymptotic Behavior. The force F = —Kx can be represented by 
the potential energy V{x) = so that Eq, (8.6) becomes 

It is convenient in dealing with an equation of this type to rewrite it in 
dimensionless form. To this end we introduce a dimensionless inde- 
pendent variable ^ — ax, and a dimensionless eigenvalue X, and attempt 
to put Eq. (13.1) in the form 
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Comparison of Eqs. (13.1) and (13.2) shows that this is possible if 



where wo = {K/m)^ is the angular frequency of the corresponding classical 
harmonic oscillator. 

The solution of Eq. (13.2) is facilitated by first examining the domi- 
nant behavior of u in the asymptotic region f ± oo .i For sufficiently 
large f it is apparent that w(f) = satisfies Eq. (13.2) so far as the 

leading terms (which are of order are concerned, when n has any 
finite value. The boundary conditions of Sec. 8 permit us to keep only 
the minus sign in the exponent. This suggests that it might be possible to 
find an exact solution of Eq. (13.2) of the form 

uU) = //(f)(3-i«- (13.4) 

where is a polynomial of finite order in Substitution of Eq. 
(13.4) into Eq. (13.2) gives as the equation for H{^) 

Zf" - 2iH' + (X - 1)H = 0 (13.6) 

where primes denote differentiation with respect to f . 

Energy Levels. We now find a solution for H in the form 

H{^) = f*(ao + aif + +•••), ^ 0, « ^ 0 (13.6) 

This is necessarily finite for f = 0. Equation (13,5) is to be valid for all 
values of so that when Eq. (13.6) is substituted into it, the coefficient 
of each power of f can be equated to zero. 

s{s “ ])ao = 0 
(s + l)sai = 0 

(5 + 2) (s+ l)a 2 - (2s+l - X)ao = 0 (13.7) 

(5 “}- 3)(s “1“ 2)(x^ — (25 H“ 3 — X)fli = 0 


(5 “h z' “|- 2)(s “h V V)(ipj ^2 — (2s “b 2z' “hi — X)a,» = 0 

where v is an integer. Since ao cannot be zero, the first of Eqs. (13.7) 
tells us that s = 0 or s = 1. The second equation tells us that s = 0, 
or tti = 0, or both. Then the third equation gives us in terms of flo, 
the fourth gives us as in terms of ai, and the general equation gives us 
ap +2 in terms of ay. 

It follows from Eqs. (13.7) that the presence in the series (13.6) of a 
finite or an infinite number of terms depends on the choice of s, ai, and 
^ We follow the polynomial method of A. Sommerfeld, “Wave Mechanics,” p. 11 
(Dutton, New York, 1929). 
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the eigenvalue X. If the series does not terminate, its dominant asymp- 
totic behavior can be inferred from the coefficients of its high terms 

Qy+2 ^ 2 

J/-»> eo 

This ratio is the same as that of the series for with any finite value of 
n. Equation (13.4) shows that this behavior for H violates the boundaiy 
conditions on u for large jf]. 

Thus the series (13.6) must terminate. This means that 
X = 25 “h 2v + Ij 

V must be an even integer, since ao 0 and otherwise the even-subscript 
terms would form an infinite series. Since the odd-subscript series can- 
not then terminate, we must choose Ui = 0. The index s can still be 
either 0 or 1, and corresponding to these two values X is equal to 2 j^ + 1 
or 2v + 3, where v is an even integer. We may express both cases in 
terms of a quantum number n 

X = 271» -f- 1, En = (^ ■2')^Wc, = 0 , 1, 2, . . . (13.8) 

Zero-point Energy. The infinite sequence of energy levels (13.8) has 
the equal spacing postulated in 1900 by Planck, which is in agreement 
with the quantization rules of the old quantum theory. However, the 
finite value of the ground-state energy level iAcjc, which is called the 
zero^oint energy, is characteristic of the quantum mechanics, and is 
related to the uncertainty principle in the same manner as is the finite 
lowest energy level for the square well with perfectly rigid walls (Sec. 9). 
The total energy is of order (Ap)^/m + where Ap and Aa; are 

measures of the spreads in momentum and position, as defined in Sec. 12; 
if this is minimized, taking account of the uncertainty relation (3.1), 
it is easily seen that the minimum Ap is of order so that the 

minimum total energy is of order h{K/in)^ or 

Parity. It follows from Eqs. (13.8) and (13.7) that n is the highest 
value of s -|- V in the series (13.6) for H. If we denote the corn^sponcling 
polynomial by .ffn(f), we see that f/n is of degree n in and is wholly 
even or odd according as n is even or odd. Since is oven and has no 
nodes, the corresponding eigenfunction i4„($) has the parity of n, and has 
n nodes. These conclusions are in agi’eement with the earlier n^sults of 
Secs. 8 and 9. 

Harmite Polynomials. The polynomial of order n that has the parity 
of n and is a solution of Eq. (13.5) with X = 27i + 1 

H': - + 2nHn = 0, 


(13.9) 
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is called the nth Hennite polynomial It is clear from the fore- 

going solution of Eq. (13.5) that these conditions define Hn uniquely 
except for an arbitrary multiplying constant. It is not necessary, then, 
to use the recursion relations (13.7) to study the detailed properties of 
the Hm if some other formulation of them can be found that is consistent 
with these conditions. A far more convenient formulation is actually 
available, which expresses the Hn in terms of a generating function 8(^,8). 

(13,10) 

If the exponential in P]q. (13.10) is expanded out in powers of s and it is 
seen that a given power of s is associated only with powers of f that are 
equal to that power or less than it by an even integer. Thus Hn{^) 
defined in this way is a polynomial of order n that has the parity of n. 
To show that this Hn satisfies the dilTcrential eciuation (13.9), we 
differentiate both sides of Eq. (13. 10) first with respect to £ and then with 
respect to s. 







.. ^ 


(-2.s' + 2^).s» 


n] 


(S) 





ti 


JO(iaating eiiual powers of .s* in sums of fh<‘S(*. two ecpuitions gives 
respectively 


= 2nHn-i 

//nil = 2^11 n — 2nlfn-,l 


(13.11) 


The low(^st order diffennitial (^illation involving only Hn that can be 
constriK^ted from Efis. (13. 1 1) is easily S(M‘n to Ix^ Kq. (13.9). Thus the 
HnU) giv(ni by lOcp (13.10) are th(^ Ibn-mil.e polynomials. 

The rc'lations (13,1 I) may be used for Llu^ eahnilatioii of the //„ and 
their deriv^ativc^s, or n.n (^xplicnt (^xi)r(’ssion old.ainable dinxitly from the 
geiKn-ating fiiiudJou may Ix^ us(xl. If dilhn’cmtiated n times with 

rospecd. to .s* and .s* is then set (xjual to 0, Va\. ( 13. 10) shows that the result 
is simply Hn{k)- Now for any fumd ion of Mx^ r()rm/(.s* — i) it is apparent 
that 

d.s- 


d/ 

df 
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Thus 

— = ef — = (- l)“cP — 

38" 3«« K >-i ^ e 

This gives an expression for the nth Hermite polynomial 


(13.12) 


The first three polynomials calculated from Eq. (13.12) are 

H,{^) = 1, = 2e, H,ii) = - 2 

Harmonic-oscillator Wave F\mctions. The generating function is 
also useful for the calculation of integrals involving the harmonic-oscil- 
lator wave functions (13.4) 

u„(a;) = iV„H«(ae)e-i“’»‘ (13.13) 

Suppose, for example, that we wish to normalize «»(») ; this is equivalent 
to choosing the constant Nn such that 

f” Ma:)|“da; = 1^' [“ H»(f)e-«y{ = 1 

^ — 00 CL J ^ aa 

The integral on the right can be expressed as a series coefficient in the 
expansion of an integral containing the product of two generating 
functions. 


- X D S (13.14) 

»"»0 m “0 

The integral on the left of Eq. (13.14) is readily evaluated directly to give 





(13.15) 


Tf equal powers of s and t are equated in the series on the right sides of Eqs. 
(13.14) and (13.15), we obtain the results 


= Ti2’*n!, 


(13.16) 


The first of Eqs. (13.16) tells us that the normalizing constant can be 
chosen to be 



•Sec. 13] 


DISCRETE EIGENVALUES: ENERGY LEVELS 


65 


where a constant multiplicative complex phase factor of unit magnitude 
is still arbitrary. The second of these equations tells us that u^{x) and 
Um{x) are orthogonal to each other lin m\ this is in agreement with the 
general result obtained in Sec. 10 for nondegenerate energy eigenfunc- 
tions, since in accordance with Eq, (13.8), En Em. H n ^ m, and so 
there is no degeneracy. 

The integral 

[ u„{x)xUm{x)dx = f 

J — 00 <X J ^ 00 

is typical of others that can be evaluated with the help of the generating 
function. The two series expressions for the integral, 




irKs + 


00 


n! 


may be equated term by term. With the help of Eq. (13,17), we get 


/ 1 ( !L+J:\ 

r « \ ^ \ 2 / 

/ Unix)xUmicc)dx «= < 1 / nV 
“ / « \ 2 / 


m, = n + ^ 


) m 


= 71-1 ( 13 . 18 ) 


otherwise 


Correspondence with Classical Theory. Plots of the first six har- 
monic oscillator wave functions arc shown in Fig. 10. It is apparent 
that the position probability densities |wn|^ associated with these station- 
ary wave furuitions have little resemblance to the corresponding densities 
for the classical harmonic oscillator; the latter are proportional to 
(So ~ where £o is the amplitude of the classical oscillator whose 

energy is equal to the quantum-mechanical eigenvalue. The agree- 
ment between classical and quantum probability densities improves 
rapidly with increasing n. Figure 11 contains a plot of \un\^ for n = 10 
(solid curve), and of the density of a classical oscillator of i.otal energy 
(dashed curve). The agreement is quite good on the average, the 
principal discrepancy being the rapid oscillations in jwnl^. 




-5-4-3'2^1 0 I 2 J 4 5 -5-4 -3 -2 -1 0 12 3 4 5 -5-4-3 -2 -I 0 I 2 3 4 5 

i ’ (, (. 

Fig. 10. Energy eigenfunctions for the first six stivtes of the htinnonic oscullator. {After 
L. Pauling andE. B. Wilson, Jr., "Introduction to Quantum Mechanics," pp. 74 75 (AfcGraiv- 
mi, New York, 1936).] 



Eeg. 11. Position probability density for the state n = 10 of a harmonic oscillator (solid 
curve) and for a classical oscillator of the same total energy (dashed curve), {After L. 
Pa/uLing arid E. B. Wilson, Jr., op. cU., p. 76.) 
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The expectation value for the potential energy can be obtained from 
Eq. (7.6): 

( V)n = Un{x)iKx^Un(x)dx 

= ^ ^)h(t3a = iEn 

where fx^\un\^dx can be calculated with the help of the generating func- 
tion in analogy with the evaluation of Eq. (13.18). Thus for any value 
of Uj the average potential and kinetic energies are each half of the total 
energy, just as is the case with the classical oscillator. 

In similar fashion it can be shown that (a;) = {p) = 0 for any 
harmonic-oscillator wave function, so that Eq. (12.1) tells us that 
{^xY = (x^), and (Ap)^ = It is then easy to see that the uncer- 

tainty product is 

Ax ■ Ap = (n + i)h 

This has the minimum possible value of Eq. (12.7) for the ground- 
state eigenfunction 

«o(a:) = (13.19) 

which, as would be expected, is of the form of the minimum packet 
(12.11). Thus the minimum packet happens to be an eigenfunction of 
the harmonic-oscillator wave equation if its Ax is properly related to K 
and m. 

Oscillating Wave Packet. In accordance with Eq. (10.18), the 
general solution of the time-dependent Schrodinger equation for the 
harmonic oscillator 

ih ^ Hx,l) 

can be expanded in terms of stationary wave fun(*,tions 

00 {Pint 

^ ^ (13.20) 

71 = 0 « = 0 

where the A,, arc arbitrary constants. Thus apart from the phase 
factor ^(:c,i) is a periodic function of f. with the period of the classical 

oscillator 27 r/cjo. This suggests that, it might be possible t,o find a solution 
in the form of a wave packet whose center of gravity oscillates with the 
period of the classical mot ion. 

To investigate this possibility, we assume that at =‘ 0 the ^ of 
Eq. (13.20) has the form of the normalized minimum packet (13.19), 
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except that the center of gravity is displaced in the positive x direction 
by an amount a. 

00 

l^(a;,0) = V A^u^ix) = ^ (13.21) 

n »0 

We can make use of the orthonormality of the Un to calculate a particular 
coeflS.cient Am by multiplying Eq. (13.21) through by Um{x) and integrat- 
ing over X, 

j“^u„ix)^ix,0)clx = 

The integral on the right can be evaluated with the help of the generating 
function by equating term by term the two series expressions for the 
integral 












00 

X 


n -0 


(s|oV*. 

n\ 


On making use of Eq. (13.17), we obtain 


(2»n!)i 

Substitution of these An into Eq. (13.20) gives 


(13.22) 


X (ifoe-‘“-‘)” 

n — 0 

= exp (- ie - -i?? - iwct - 
= ^ exp [- i({ — lo cos Wei)* 

TT* 

— i(icjot + ffo sin cjct — ifo slii 2coci)] 

where the sum is evaluated with the help of the generating function 
(13.10). The absolute square of this wave function gives a position 
probability density 


TT* 
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This shows that ^ represents a wave packet that oscillates without 
change of shape about x = Oj with amplitude a and the classical frequency. 

As a 0, ^ approaches the lowest energy eigenfunction Uq(x)^ 

The larger a becomes, the larger the number of stationary states that 
contribute significantly to the packet, and the larger the quantum num- 
ber no for which An of Eq. (13.22) has a maximum. For n >$> 1, we can 
use Stirling's formula to maximize In An] neglecting terms of order In n 
and lower 

In An = n(ln Jo — i In 2) — in(ln n — 1) 

n ^ 1 12 _ (13.23) 

Thus the energy level i?„o = (no + from whose neighborhood most 

of the contribution to yp comes, is approximately equal to the energy 
of the classical oscillator that has the same amplitude. 


14. SPHERICALLY SYMMETRIC POTENTIALS IN THREE DIMENSIONS 


It is generally impossible to obtain analytic solutions of the three- 
dimensional wave equation (8.2) unless it can be separated into total 
differential equations in each of the three 
space coordinates. It has been shown^ 
that there are 11 coordinates systems 
in which the free-particle wave equa- 
tion [Eq. (8.2) with 7 = 0] can be 
separated. One of the most important 
of these is the spherical polar-cooixlinate 
system, in terms of which the rectangular 
coordinates are given by (see Fig. 12) 


a; = r sin 0 cos 0 
2/ = r sin 0 sin 
z = r cos 6 



Fiq. 12. llolation betwoon roc- 
tangulur and nplicrioiil polar coordi- 
natoH of a point P. 


If the potential en(M‘gy is spheric, ally symmetric, so that 7(r) = V{r) is a 
function only of the magnitude r of r measured from some origin, the wave 
equation can always be separated in spherical coordinates. Many prob- 
lems of physical interest can be represented exactly or approximately 
in terms of spherically symmetric potentials of various shapes. In 
this section we elTect the separation, and solve the resulting total dif- 
^ L. P. Eiscnhiirt, Rev., 46, 428 (1934). See also li. Pauling and E. B. Wilson, 
Jr., “Introduction to (iuantum Mechanic, b,” Appendix IV (McCIraw-Hill, N(iw York, 
1936). 
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ferential equations in B and 0. The next two sections deal with the 
solution of the radial equation for particular forms of 7(r). 

Separation of the Wave Equation. The wave equation (8.2) with a 
spherically symmetric potential energy may be written in spherical 
coordinates 


2m [r*! dr \ dr) r* sin 6 dd ® dd) r'^ sin* 6 d<t>^\ “ 

+ V{r)u = Eu (14.1) 

We first separate the radial and the angular parts by substituting 

u{r,e,<t>) = R{r)Y{e,4>) 
into Eq. (14.1) and dividing through by u. 

- “ 7 

Since the left side of Eq. (14.2) depends only on r, and the right side 
depends only on 6 and 0, both sides must be equal to a constant that we 
call X. Thus Eq. (14.2) gives us a radial equation 

r=s(’-‘f) + t® - ’'Ml - r<l ® - 0 


and an angular equation 
1 d 


sin d dd 




+ 


dw 


sin* 6 d4>^ 


+ xy = 0 


(14.4) 


The angular equation (14.4) can be further separated by substituting 
r(fl,<#>) = 6(0)#((/>) into it and following the same procedure to obtain 


d4>^ 


+ v<f> = 0 


1 d 
sin 6 dd 




(14.5) 

(14.0) 


The <j) equation (14.5) can be solved at once; its general solution may be 
written 


$(<#)) = 

#((#>) = A + B<t), 


V 9^0 

V = 0 


(14.7) 
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The requirement of Sec. 8 that $((/») and d^/d^ be continuous through- 
out the domain 0 to 27r of </> demands that v be chosen equal to the square 
of an integer. We thus replace Eqs. (14.7) by 

(14.8) 

where now all physical meaningful solutions are included if m is allowed 
to be a positive or negative integei’ or zero;^ the multiplying constant is 
chosen equal to (27r)~i in order that $ be normalized to unity over the 
range of 

Legendre Polynomials. Unless V {r) is specified, the farthest we can 
carry our treatment is the solution of the 6 equation (14.6), where now 
V = It is convenient to substitute w = cos Q for and put 

0 ( 0 ) = P{w), 

when Eq. (14.G) becomes 

Since the domain of 0 is 0 to tt, the domain of ly is 1 to — * 1 . The procedure 
for solving lOcp (14.1)) is in many respects similar to the solution of the 
harmonic-oscillator wave cciuation presented in Sec. 1.3, and will not be 
given in detail here.- Sinc.e lOq. (14.9) is a second-order dilTerential 
equation, it has two linearly independent solutions. Except for particu- 
lar values of X, both of these are infinite at w — ±\, and in accordance 
with Sec. 8 ar(‘ not physically acceptable. If however X = i(i H- 1), 
where I is a i)osii.ive integer or zero, one of the solutions is finite at w = ±\ 
(the othei* is not); this finite solution has the form (1 — times a 

polynomial of order I — jml in Wj and has the parity of I — \m\. 

The physically acceptable solutions of Eq. (14.0) when m = 0 are 
called the Ijcgondrc polynoinmls Pi(w). Just as is the case with the Her- 
mite polynomials, their properties may be discussed in terms of a generat- 
ing function 

T{w,s) = (I - 2hw + 

CO 

= y Piiwy, .<!<]. (14.10) 

^ At the very sliRht risk of confuBion with the maflH of the particle, we make use of 
the cuatonuiry syinliol //j for llio quantum number aasociatoxl with the coordinate 

* For a complete diacuBBion of tluH equation, sec E. T. Whittaker and O. N. Watson, 
“A Course of Modern Analysis,” 4th cd., Chap. XV (Cambridge, Ivondon, 1936). 
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Differentiation of the generating function with respect to w and s leads to 
relations that are analogous to Eqs. (13.11) for the Hermite polynomials. 

(1 - w^)Pi = - iwPi + Wi-i ru in 

(l + l)Pj+i = (21 + l)wPi - IP^i 

where primes denote differentiation with respect to w. The lowest order 
differential equation involving only Pi that can be constructed from Eqs. 
(14.11) is easily seen to be Eq. (14.9) with X = Z(Z + 1) and m = 0. 

For m not necessarily equal to zero, Eq. (14.9) has physically accept- 
able solutions if X = Z(Z + 1) and \m\ g Z. These solutions, which ai^e 
called associated Legendre functions, are expressible in terms of the 
Legendre polynomials 

Pp(to) = (1 _ ^ (14.12) 

This can be shown by substitution of Eq. (14.12) into the equation that is 
obtained by differentiating [ml times the equation for Pi{w), The 
generating function for the associated Legendre functions is obtained by 
differentiating Eq. (14.10) \m\ times with respect to w and multiplying 
by (1 — to2)il«l, 

rr _ (2|m|)!(l 

“Ws; 2l“l(lml) !(1 - 2sw + 

00 

= y Pr{w)s^ (14.13) 

Spherical Harmonics. The angular part Yimid,<l)) of the complete 
wave function, which is a solution of Eq. (14.4) when X = Z(Z + 1), is 
called a spherical harmonic. It is apparent that 

= NimPTi^OH 0)$,„(0) (14.14) 

where ^m(</>) is given by Eq. (14.8), and Ni„i is the normalization constant 
for the associated Legendre function. 

The same proof that was given in Sec. 10 for the orthogonality of the 
energy eigenfunctions may be used to show that soliitioas of lOcp (14.4) 
corresponding to different eigenvalues X or Z are orthogonal. The 
eigenvalue Z is, however, (2Z + l)-fold degenerate, since there exist 
linearly independent solutions Yim{6,<l)) for this value of Z and all integer 
values of m between +Z and — Z. The choice of Eq. (14.8) for ^^, »{<!>) 
makes these degenerate eigenfunctions orthogonal. We have then that 
the integral 

jo Yt,n(d,4dYiw(&,<l>) sin Bded4> = YimYi'„,'dv)d<ji 
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vanishes unless I = V and m = m'. It is interesting to note that there is 
no more orthogonality present than is necessary to make this integral 
vanish when it should. Thus the part of the integral vanishes when 
m 9 ^ w! without regard for the I values; the d or w part of the integral 
vanishes only when I ^ V and |m| = jm'j, since for m 9 ^ w! the ortho- 
gonality is taken care of by the integration over 

The integral 

fix (14.15) 

can be evaluated in various ways, for example, by using the generating 
function (14.13) in a manner similar to that described in Sec. 13. As 
expected, the integral (14,15) vanishes unless I = I', when it has the 
value [2/(2Z + !)][(? + lm|)!/(Z — |m|)!]; thus Nimy which contains an 
arbitrary complex phase factor of unit magnitude, may be taken to be 
the reciprocal of the square root of this quantity. The normalized 
spherical harmonics are then 

YUe,<l>) = ^PTicoa (14.16) 

The first four spherical harnionics are 

" (5)' 

- (ly 

Parity. The concept of parity introduced in Sec. 9 can now be 
extended to three-dimensional problems of the type discussed in this 
section. Suppose that the position coordinate r is reflected through the 
origin so that r is replaced by — r; this corresponds to replacing x by — .r, 
2 / by — ?/, and z by — 2 ?, or to replacing dhyir— dy <l) by <i> + tt, and leaving 
r unchanged. It is clear that the only change in the wave ecpiation 
(14.1) is that u{ry 6 y<j)) is replaced by w(r,7r — Bycl> + tt), the rest of the 
equation being unatTected. Then the discussion of Sec. 9 shows that 
orthogonal linear combinations of degenerate eigenfunctions can be 
found that have definite parities, and that a nondegenerate eigenfunction 
must have a definite parity. 

The energy levels for a spherically symmetric potential are degenerate 
at least with respect to the ciuantum number for Z > 0. In this case, 
the degenerate eigenfunctions all have the same parity, which we now 
show to be the parity of 1. When r is reflected through the origin, the 
radial part li{r) of the solution is unchanged, the <i> part 4^(0) given by 
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Eq. (14.8) has the parity of \m\, and the 0 part Pf (cos 0) has the parity 
oil — \m\y since P^{w) is equal to an even part (1 — times a poly- 

nomial in w that has the parity of i — l^lwith respect to change in sign 
of w or cos 0. Thus and hence ^(r), has the parity of I, 

Angular Momentum. The radial wave equation (14.3) may be 
rewritten in a form that resembles the one-dimensional wave equation 
(8.5). If we put R{r) = x(0A7 equation for the modified radial 
wave function x niay be written 


Thus the radial motion is similar to the one-dimensional motion of a 
particle in a potential 


V(r) + 


l(l + IW 

2mT^ 


(14.18) 


The additional “potential energy” can be seen physically to be connected 
with the angular momentum in the following way. A classical particle 
that has angular momentum M about the axis through the origin per- 
pendicular to the plane of its orbit has the angular velocity co = M/mr^ 
when its radial distance from the origin is r. An inward force 


is required to keep the particle in this path; this “centripetal force” is 
supplied by the potential energy, and hence adds to the V{r) that appears 
in the radial motion an additional “centrifugal potential energy” 
This has exactly the form of the extra term in (1 4. 1 8) if we put 

M = [1(1 + 

The foregoing physical argument for identifying the quantum number 
I with the angular momentum of the particle can be put in quantitative 
form by finding the operators that correspond to the three componcuits 
of the angular momentum vector. Classically, we have that M = r X p> 
so that we take in quantum mechanics 

M. = yp.-zp,= 

= 2px - = —ih - ^ ^ (14.19) 

M. = xp,-yp.= -ih(x-l^-y^ 

Equations (14.19) can be transformed into spherical polar coordinates to 
give 
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M.. 

M, 


x = ih ^si 
y = ih ^ — 


sin <#> + cot 6 cos <l> 

do 


COS <t>-^ + cot 6 sin 

do 






(14.20) 


The operator that represents the square of the total angular momentum is 
then found from hlqs. (14.20) to be 


M« = Ml + Ml + Ml 

1 ^ a 

sin 0 t)0 ® 30/ 


= 


+ 


1 


sill- 0 30 


3^ 1 

50-J 


(14.21) 


Comparison of lOcis. (14.21) iind (14.4) shows that Fi,»(0,0) is an eigen- 
function of M- witli the eigtmvalue l{l + l)^-. 


M‘-^r,„.(0,0) = l{l + (14.22) 

In similar fasliion, it follows from the structure of Eq. (14.8) and the last 
of Eqs. (14.20) that 45^(0), and hence also Fin, (0,0), is an eigenfunction of 
Ma with the eigenvalue inh. 

M.YiM) = 7nhYUe,ct>) (14.23) 

Thus the separation of the wave cciuation in spherical polar coordi- 
nates results in wave funcillons that are eigenfunctions oi both the total 
angular momentum and th(^ (^olnponent of angular momentum along 
the polar axis, 'bhe (quantum number I that appears in lOq. (14.22) is 
called the azimullial or orlrUaL amjular-mmmmLuvi quanlwn number. The 
(quantum number m that appears in Va\. (14.23) is called the magnetic 
quantum number^ sine.e it is of imi)orliin(‘-c in (lie Ihoory of the Zeeman 
effect (see ycc. 30), which involves the e.omponent t)f angular momentum 
along the magnetic field {z axis). It should l>e noted that the wave 
equation cannot in general lie separatcul in this way and angular-momen- 
tum eigenfunctions obtained if the pol-(uitial energy VO') spherically 

symmetric. This corresponds to the classic^al result that the angular 
momentum is a constanl. of the moiiion only ior a (uml.ral field of force 
(which is des(;ril)al)le by a spheric*, ally symm(4/ri(; poi.ential). Iherc is, 
however, the charactcu’isl.ic^ dilTerenc^e bel,ween elassictal and quantum 
theory that all three componeni-s of M can be iircMUsely specified at once 
in the classical theory, whereas only Mg and M** can in general be pre- 
cisely specified at once in the cpuint^um theory, sinc^e F/„»(0,0) is not an 
eigenfunction of and My (excc'pl- for the case I = 0). This result can 
be related to the uncertainty principle (see Prob. 5, (3iap. I). The choice 
of the direction of the polar axis (hat distinguishes Mg from M^ and My 
is, of course, completely arbitrary; it corresponds to the arbitrariness of 
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the dhection of space quantization in the absence of external fields in 
the old quantum theory. 


16. THREE-DIMENSIONAL SQUARE WELL POTENTIAL 

We are now in a position to find the bound-state energy levels that 

correspond to particular choices of the 
potential energy 7(r) and of the angular- 
momentum quantum number Z, by solving 
the radial wave equation (14.3). As afimt 
example, we consider the square well poten- 
tial of finite depth, for which V{r) = — 7o, 
r < a, V{t) = 0, r > a, where Fo is positive 
(see Fig. 13). A spherical region of this 
type in which the potential is less than that 
of the surroundings serves to attract a par- 
ticle just as in the one-dimensional case 
considered in Sec. 9. 

Zero Angular Momentum. When I = 
0, it is easier to solve the wave equation in the form (14.17) than in the 
form (14.3). In this case, i?(r) = x(^)/2", and the wave equation is 



Eia. 13, Spherically symmet- 
rio square well potential of 
depth Vo and radius a. 


“ Is 55 " “ 


(15.1) 


The solution of Eqs. (15.1) is the same as that obtained for the finite 
potential step in Sec. 9, except for three points: first, the energy scale is 
lowered everywhere in the present problem by an amount Fo; second, the 
domain of r is 0 to + oo, in place of the domain — « to +00 for a:; and 
third, the boundary condition that the wave function not become infinite 
at a; = — «» is now replaced by the same condition at r = 0. 

From the discussion of Sec. 9, it is apparent that the solutions of 
Eqs. (15.1) are 


xif) = A sin or + B cos or, 
x(r) == Ce-^% 



r < a, 

^ - m 

r > a 


(15.2) 


where we are interested in bound-state energy levels for which B < 0. 
The requirement that i?(r) be finite at r = 0 demands that we set B = 0 
in the first of Eqs. (15.2). Thus the solution has the form of the odd 
parity solution of the one-dimensional problem. The energy levels are 
obtained by equating the two values of {l/x){dx/dr) at r = a (this is 
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equivalent to making {l/R){dR/dr) continuous there), and ai’e given by 
solving 

a cot aa = —jS (16,3) 

which is the same as Eq. (9.6). Then it follows from the discussion of 
Fig. 9 that there is no energy level unless Foa^ > 7rW/8m; there is one 
bound state if < Foa- ^ 97r-?i^/8m, etc. 

Interior Solutions for Arbitrary L For nonzero values of Z, it is more 
convenient to work with the original radial equation (14.3) than with the 
equation for x- H' we put p = a?', where a is defined in Eq. (16.2), the 
wave equation for r < a becomes 


,2dR, 


i(l + 1) 




(15.4) 


The strong resemblance between Eq. (16,4) and BessePs equation sug- 
gests that R(r) can be expressed in terms of Bessel functions. This is in 
fact the case; if Ave define the ‘‘spherical Bessel function^' ji{p) that is 
regular at p = 0 by^ 




(15.5) 


where J is an ordinary Bessel function of half-odd-integer order, it is 
easily verified that ji{p) satisfies Eq. (15.4). The definition (15.5) also 
applies (except for sign) to the relation between the “spherical Neumann 
function” ^^(p) that is singular at p = 0 and the corresponding singular 
(negative-order) solution of Bessel’s equation. 

It can be shown- that where I is a positive or negative integer 

or zero, is expressible as a sum of produ(d;S of sin p and cos p Avith poly- 
nomials of odd order in p“ K In particular, explicit expressions for the 
first three j’s and n’s are 


(15.(5) 



no{p) = 

COS P 

p 

cos p 

p 

7l,l(p) = 

cos P 
•> 

p- 




3 

-2 cos P, 

ihip) = 

Vo’ 


3 . 

^,sinp 


The leading terms for small p arc'* 


1 This definition and the properties of ilio ji and ?li are taken from P. M. Morse, 
“Vibration and Sound,” 2d od., pp. 316-317 (McGraw-Hill, Now York, 1948). 

®G. N. Watson, '‘Theory of Bessel Functions,” 2d cd., p, 62 (Macmillan, New 
York, 1946). 

® Equations (16.7) are useful approximations for p® somewhat leas than 4^ -|- 6 and 
2, respectively (G. N. Watson, op, city p. 44). 
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(15.7) 


and the leading terms in the asymptotic expansions are‘ 

jl(p) ^ - cos [p - i(Z + l)7r] 

p— > 00 p 

niip) > - sin [p — ^{l + l)ir] 

fy 


(15.8) 


Some properties of the j’s and «.’s are 

Jjo(p)/>®dp = ip“[iS(p) + no(p)ii(p)] 
inl{p)pHp = ip*K(p) - io(p)ni(p)] 

ni-i(p)ji(p) — nj(p)jj_i(p) = I > 0 

The following are properties of both the j’s and the n’s: 


it-i(p) +in.i(p) = — ^Jj(p), I > 0 

P 

^ij(p) = 21 + i ty'-i(p) — (^ + i)jVi(p)] 

d 

^ [p*-^yi(p)] = p^+Vi-i(p), I > 0 (15.10) 

^ [p“yi(p)] = — p-'yz+i(p) 

iji{p)dp = — joip) 

JjoMp^dp = p^ji(p) 

iji{p)p^dp = ip'^’b'Kp) — ii:-i(p)jf+i(p)] I > 0 

Since R{t) must be finite for r = 0, the desired solution for r < a is 

R(r) = Aji(ar) (15.11) 

Exterior Solutions for Arbitrary Z. The wave equation for r > a can 
be put in the form (15.4) if we redefine p to be ipr, Avhere /3 is given in Eq. 
(15.2). Since the domain of p does not now extend in to zero, there is 
^ Equations (16.8) arc useful approximations for p somewhat larger than ]l(l + 1) 
(G. N. Watson, op. dt, p. 199); however, the magnitudes (although not the phases) of 
ji and fii are given to good approximation by (15.8) if p is somewhat larger than 2, 
which is approximately the value of p for which the magnitude of i.s greatest. 
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no reason why ni cannot, appear in the solution. The linear combination 
of ji and ni to be selected will be determined by the asymptotic form, 
which must fall off exponentially for large r. This suggests that we define 
spherical Hankel functions 


WKp) = Ji(p) + iniip) 
h?Kp) = ji(p) — wi(p) 


( 15 . 12 ) 


which from Eqs. (15.8) have the asymptotic forms 

p — > p 
/,,p(p) > 1 

p—f DO P 


(16.13) 


It can be shown that the asymptotic expansions, of which Eqs. (15.13) 
give the loading terms, contain no terms for which the exponent has the 
opposite sign to that given. 

The desired solution for r > a is then 


U{r) = BhVK'lM = + ini(ipr)] (16.14) 

The first three of these functions are 


/,,V>(v:i3r) = ,:(! + (15.15) 


Energy Levels. 'I’ho cncirgy levels are obtained by requiring that 
(l/Ii){dlt/tlr) be (lontimiouH nt r = a. When this condition is applied to 
the interior solution (15.1 1) and the exterior solution (15.15) with I = 0, 
we obtain Mep (15.8). 'I'his may be written as 


^ cot J = -ij, 


e + T 


2m V o(i^ 


(15.16) 


where, as in Sec.. 1), wo. have put ^ = im and r; = pa. The same condi- 
tion appliiul to th(‘ solutions for / = I reduces, with the help of MqB. 
(15.6) and (15.15) to 


cot ? 1 1 I 1 

—z V- = “ I 

f f- n* 


e + 


2m Vj^ 


(15.17) 


Equations (15.17) may be solved numericially or graphically, by the 
methods indic.at(Kl for the solution of Eqs. (15.16) in Sec. 9. In general, 



QUANTUM MECHANICS 


80 


[Chap. IV 


there is no degeneracy between the eigenvalues obtained from the solution 
of (equations like (15.16) and (16.17) for various values of I 

It is easy to see how many energy levels Eqs. (15.17) give for various 
values of without going through the numerical work. A new level 
appears whenever rt is zero or cot { is infinite. This occurs at J = t, 
27 r, . . . . Thus there is no energy level with Z = 1 when 


7oa2 ^ 


2m 


there is one bound state with Z = 1 if Tr%^f2m < Toct® ^ 27rW/7n, etc. 

The smallest value of Voa^ for which there exists a bound state with 
Z = 1, is greater than the corresponding value of Vod^ for Z = 0; this is 
reasonable from a physical point of view. The interpretation in Sec. 14 
of the Z term in the radial wave equation as an additional potential energy, 
which corresponds to the repulsive “centrifugal force,” suggests that a 
particle possessing angular momentum requires a stronger attractive 
potential to bind it than a particle with no angular momentum. Indeed, 
it turns out that the minimum square well potential “strength” Voc’^ 
required to bind a particle of orbital angular-momentum quantum num- 
ber Z increases monotonically with increasing 


16. THE HYDROGEN ATOM 

The potential energy 7(r) = which represents the attractive 

Coulomb interaction between an atomic nucleus of charge +Ze and an 
electron of charge —e, provides another wave equation that can be 
solved analytically. This problem is of direct physical interest, since 
apart from relativistic effects (see Chap. XII), the calculated energy 
eigenvalues are in agreement with the observed eneigy levels of the hydro- 
gen atom (Z = 1), the singly charged helium ion (Z = 2), etc. 

Reduced Mass. The Schrodinger wave equation developed in Sec. 6 
describes the motion of a single particle in an external field of force. 
Now, however, we are interested in the motion of two particles (nucleus 
and electron) that are attracted to each other by a force that depends 
only on the distance between them. The form of the wave equation to 
be used for two particles is suggested by the extension of the w'ave 
equation from one to three dimensions that was given in Sec. 6. This 
extension involved making the wave function depend on the three 
rectangular coordinates Xj y, and z instead of just on a;, and introducing 

1 It can be shown that bound states appear with zero energy for a particular I 
value when [{\/Ti){dR/dr)]r^ = -0 4- l)/a; for J > 0 this is equivalent to the 
condition « 0 where now t « (27nToaV^*)*. 
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the momenta corresponding to the new coordinates as they appear in the 
classical expression for the energy. 

A similar extension from three to six rectangular coordinates leads 
directly to the Schrodinger wave equation for two particles of masses 
TUi and m 2 : 


^2 / Q2 Q2 \ 1 


where the potential energy is assumed to depend in an arbitrary manner 
on all six coordinates and the time. If now the potential energy depends 
only on the relative cooi’dinates, so that V = V{xi — 0:2, 2/1 “ 2/2, 2^1 22), 

an important simplification can be made. We define relative coordinates 
and coordinates of the center of mass X^Y^Z by 


^ ‘^ 2 ; y 2/1 2 / 2 j 2 / Zi Z 2 2^ 

MX = miXi + m2X2, MY = rriiyi + m22/2, MZ = miZi + in2Z2 ^ 

Here, M = vi\ + mo is the total mass of the system. Equation (16.1) 
can be rewritten in I crms of i.bc new coordinates 


ih 


[ 2M VaA'2 ^ ^ ozy 


2jLt \dx'^ dy'^ dz^ 


Ji) + nw)]’A (16.3) 


where 


7nim.2 
mi + nh 


(16.4) 


is called the reduced mass. 

Two separations of the wave eciuation ( 1 6.3) can now be made. First, 
the time depcndciKJc can Ix^ soparalxMl out, as in Sec. 8; and second, a 
separation (?an be made into a product, of functions of the relative coordi- 
nates and center-of-mass coordinates. The process is straightforward 
and simple, and results in 

^{x,y,z,X,Y,Z,t) - u{.r,y,z)U{X,Y,Z)e ^ 

— — Vhi + Vti = Eu (16.5) 

2/1 


]}l 

2M 


VHJ = WU 


where the v“ operators in the second and third equations imply diffei- 
ontiation with respect to the relative and center-of-mass coordinates, 
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respectively. The second of Eqs. (16.6) describes the relative motion of 
the two particles, and is the same as the equation for the motion of a 
particle that has the reduced mass /x in an external potential energy F. 
The third of Eqs. (16.6) tells us that the center of mass of the system of 
two particles moves like a free particle of mass M. 

In the hydrogen-atom problem, we shall be interested in the energy 
levels E associated with the relative motion. In this case, the reduced 
mass II is only slightly smaller than the electronic mass, since atomic 
nuclei are far more massive than electrons. 

Asymptotic Behavior- The separation of the relative motion in 
spherical coordinates is made as in Sec. 14. The radial equation that 
corresponds to the angular-momentum quantum number I is then 


2n dr \ dr / 



l(l + l)y 

2fir^ 


R = ER 


(16.6) 


where E < 0 for a bound state. We follow the polynomial method used 
in the treatment of the harmonic-oscillator equation that was given in 
Sec. 13, and first attempt to rewrite Eq. (16.6) in dimensionless form by 
introducing a dimensionless independent variable p = or. Unlike Eq. 
(13.1), however, where the leading term for large x was the potential 
energy term the leading term in Eq. (16.6) for large r is the eigen- 

value term E. We therefore choose a so that the E term becomes a 
fixed number; this makes the dominant asymptotic behavior of the solu- 
tion independent of the eigenvalue. We rewrite Eq. (16.6) as 


J. ^ / 2 . /X _ 1 

p^dpY W \P 4 




(16.7) 


where the particular choice of the number i for the eigenvalue term is 
arbitrary but convenient for the following development. Comparison 
of Eqs. (16.6) and (16.7) shows that 



As with the harmonic-oscillator equation, we first find the dominant 
behavior of R(p) in the asymptotic region p— oo. For sufficiently 
large p, it is apparent that R(p) = satisfies Eq. (16.7) so far as the 

leading terms (which are of order R) are concerned, when n has any 
finite value. This suggests that we look for an exact solution of Eq. 
(16.7) of the form 


R(p) = F(p)e-lp 


(16.9) 
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where F{p) is a polynomial of finite order in p. Substitution of Eq. (16.9) 
into Eq. (16.7) gives as the equation for F{p) 



+ 1 ) 

P^ 


F = 0 


(16.10) 


where primes denote differentiation with respect to p. 

Energy Levels. We now find a solution for F in the form 


F(p) = p’(ao + aip + ojip® + • • • ) 

= p’L{p), ao ^ 0, s 0 (16.11) 

This is necessarily finite for p = 0. Substitution of Eq. (16.11) into Eq. 
(16.10) gives as the ecpiation for L 

p^L" + p[2(s + 1) - p]/y + [p(X - s - 1) + s(s + 1) - l{l + l)]i^ = 0 

If p is set eciual to zero in this equation, it follows from the form of L 
implied by Etp (16.11) that s(s + 1) - l{l + 1) = 0. This quadratic 
equation in s hius two roots: .s = I and s = —(1 + 1)- The boundary 
condition that lt(,p) be finite at p = 0 requires that we choose s = 1. 
The equation forL then becomes 

pL" + [2(1 + 1) - p]L' + (X - 1 - 1)L = 0 (16.12) 

Equation (16.12) can bo solved by substituting in a power series of the 
form indicated by I‘i<|. (Ifi.ll). d'hc rccnirsion relation between the 
coefficients of successive terms of th(! series is readily scon to be 


r + 1 + 1 — X 

“■’+‘ “ (Tnirr+^TT^ ““ 


(16.13) 


If the series does not terininiite, its doiniruint nsymptotie behavior can 
be inferred from the eoeffieaents ol' its hi^ 2 ;h terms: 


1-1 1 

dp > 00 

This ratio is the same as that of the series lor p’‘i'<‘ with any finite value oi 
n. E(iuations (16.9) and (16.11) show that this behavior for L violates 
the boundary condition on It for large p. 

Thus the s(!ries for L must terminate. If the highest power of p in 
L is p”'(n' ^ 0), we must choose X (siual to a positive! integer «,* such that 

X = n = u' + 1 + 1 (16.14) 


' The resiiU. tlini the allowed values of X are integers, rather than irmltiplcs of 
integers, derives from the e.hoiee of i for the eigenvalue term in the dimensionless 
radial wave equation (1G.7). 
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n' is called the radial quantum number and n the total quantum number. 
Since n' and I can take on positive integer or zero values, n can have the 
values 1, 2, ... . The energy eigenvalues are given by Eq. (16.8) 

in agreement with the old quantum theory and with experiment. Unlike 
the square well potential problem considered in Sec. 15, the Coulomb 
field problem gives rise to an infinite number of discrete energy levels 
extending from — /AZ®e^/2A^ up to zero, for any finite value of Z. This is 
due to the slow decrease in magnitude of the Coulomb potential at large r, 
Laguerre Polynomials. The physically acceptable solutions of Eq. 
(16.12) with \ = n may be expressed in terms of the Laguerre 'poly- 
nomials ifl(p), which can be defined in terms of a generating function 

- .IL. 

p 1—8 

i7(M) = ^ 

eo 

= S<1 (16.16) 

Differentiation of the generating function with respect to p and s leads to 
relations that are analogous to Eqs. (13,11) for the Hermite polynomials 
and (14.11) for the Legendre polynomials 


L\ — — —qLq-.\ 

Lg+i = (20^ + 1 — p)Lq — q^Lq^i 


(16.17) 


The lowest order differential equation involving only Lq that can be 
constructed from Eqs. (16.17) is easily seen to be 


P^q + (1 — p)L'^ + qLq = 0 (16.18) 

Equation (16.18) resembles Eq. (16.12) but is not quite the same. We 
define the associated Laguerre polynomial 

i?(p) =|^pi'«(p) (16.19) 

on differentiating Eq. (16.18) p times, it is seen that L5(p) satisfies 

pLj" 4- (p + 1 _ p)m + (g. - p)L» = 0 (16.20) 

Comparison of Eq. (16.12) with X = n and Eq. (16.20) shows that the 
desired polynomial solutions are the associated Laguerre polynomials 
which are of order (n + 0 — (2Z + 1) = n — I — 1 in agreement 
with Eq. (16.14). 
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Differentiation of Eq. (16.16) p times with respect to p gives the 
generating function for the associated Laguerre polynomials, 


UM 


(1 - L/ gl 


(16.21) 


The following explicit expression may be verified by substituting it 
into Eq. (16.21) with n + I = q and 21 + 1 = p, and interchanging the 
order of the two summations: 


LVtKp) 


n-I-1 


l)t+l 


lin + l)\]Y 


*=n 


/f)l(2Z + 1 + k)\k\ 


(16.22) 


Hydrogen-atom Wave Functions. The radial wave function is of the 
form er^i‘ffL^^{p). The normalization constant may be found by using 
the generating function to evaluate the integral 




e-Y[LVfiKp)]Vdp = 


(16.23) 


Thus the normalized energy eigenfunctions for the hydrogen atom are 


an — 


fxe- 


2Z 

p = ■ — r 
nao 


where Yu,Xd^<t>) is the normalized spherical harmonic given in Eq. (14.16); 
rtn is the radius of the first (circular) Bohr orbit for hydrogen (Z = 1) 
in the old (piantum theory. The energy levels (16.15) may be written 

ZV 


En = - 


2a{i7i^ 


The first three radial functions, which arc found from Eqs. (16.22) 
and (16.24), are 


i()(r) 


R2i){r) 


R2i{r) 



2ao 
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A much more complete set of these functions, with graphs of some of 
them, is given by Pauling and Wilson.^ 

It is interesting to note that each of the eigenfunctions for which 
I = 0 has a discontinuous gradient at r = 0, since dR^^/dr 9 ^ 0 there 
and Foo is independent of d and This is a consequence of the infinite 
potential energy at that point, as can be shown by means of a limiting 
process similar to that used in Sec. 8 to derive the boundary conditions 
at a perfectly rigid waJl. 

Degeneracy. The energy eigenvalues (16.15) depend only on n, and 
so are degenerate with respect to both I and m. Thus for each value of 
71 , 1 can vary from 0 to n — 1, and for each of these I values, m can vary 
from — Z to +Z. The total degeneracy of the energy level is then 

V {21 + 1) = 2 + « = n« 

It follows from the discussion of Sec. 14 that the degeneracy with 
respect to m is characteristic of any central force field, for which V 
depends only on the radial distance r from some point. The I ilegener- 
acy, however, is characteristic of the Coulomb field, as distinguished from 
most other central force fields. In some problems, such as the motion of 
the valence electron of an alkali atom, the potential energy of the electron 
is central, but only approximately of the Coulomb form. This pre- 
vents the n energy levels that have the same total (luiintum number 71 
and different I from being coincident, so that the nth hydrogen-like level 
splits up into n distinct levels. If also some external field (such as a 
magnetic field) that destroys the spherical symmetry is imposed, the 
(21 + l)-fold 771 degeneracy disappears, and the nth hydrogen-like level 
is split up into distinct levels. 

The existence of degenerate energy eigenvalues means that linear 
combinations of the corresponding eigenfunctions are solutions of the 
wave equation with the same energy. In the case of (he m (legeneracy, 
such linear combinations of the spherical harmonics can be 

found that correspond to a new choice of the polar axis. It is reasonable 
to expect that linear combinations of the degenerate hydi'ogen-atom 
eigenfunctions that have the same n and different I exist that also corre- 
spond to some new choice of the coordinates. This is, in fact, the case, 
since it turns out that the hydrogen-atom wave equation can be separated 
in parabolic coordinates. In general, degeneracy will occur whenever 
the wave equation can be solved in more than one way (in different 


^ Pauling and Wilson, op. cit., Sec. 21. 
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coordinate systems, or in a single coordinate system oriented in dif- 
ferent ways), since if there were no degeneracy the wave functions 
obtained in the different coordinate systems would have to be identical 
except for a multiplying constant, and that is usually not possible. For a 
general central field, an exception occurs Avhen I = 0, since then the wave 
function is spherically symmetric and has the same form for all orienta- 
tions of the polar axis, so that there is no degeneracy. A similar excep- 
tion occurs in the hydrogen atom problem when n = 1, in which case 
it turns out that the solutions obtained by spherical and parabolic 
separation of the wave cciuation are identical. 

Separation in Parabolic Coordinates. The parabolic coordinates 
are given in terms of the spherical polar coordinates by the relations 

^ r — z = r{l — cos 0) 

^ = r + 2 = r(l + cos 0) (16.25) 

0 = <^ 


The surfaces of constant ^ arc a set of confocal paraboloids of revolution 
about the z or polar axis, with focus at the origin, that open in the direc- 
tion of positive z qy 6 = 0. The surfaces of constant rj are a similar set 
of confocal paraboloids that open in the direction of negative 2 or 0 = tt. 
The surfaces of const, ant 0 are the same as in the spherical coordinate 
system: planes through the polar axis. 

The wav(^ (Mpialion for the hydrogen atom in parabolic coordinates is 



E <0 (16.26) 


The separation Is accomplished by substil.uting 

into K{\. (16.26) and dividing through by //; the 0 part of the ecpiation 
separates at oncc^: 


^ + (iV 



+ V) 


2fi\E\^r} 


1 


( 16 . 27 ) 


Since the left side of Kq. (16.27) depends only on ^ a.nd rj, and the right, 
side only on 0, both sides must be eciual to a const ant that, we call in 
accordance with the discussion of Sec. 14, t,his gives normalized 0 solu- 
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tions that are the same as (14.8) 


#„((^) = m = 0,±1,±2, . . . (16.28) 

The rest of Eq. (16.27) can be separated into | and i? parts: 

1 d ( ydf\ _ n\E\ , ^iZe^ 

SdkVdi) 4e 

where the separation constant v is to be determined by the boundary 
conditions. Thus the equations for / and g are 


d 

dH 


ydO \4{ ^ 2A* ^ 

d / dg\ ii\E\ri \ 


(16.30) 


Since these two equations are of the same form, and differ only in their 
constant terms, it is sufficient to solve one of them. 

Energy Levels. The first of Eqs. (16.30) may be solved by the 
method used to solve (16.6). The substitution f = af puts it into the, 
dimensionless form 


f<irV«/ xr 4 a'J' 

if we choose the parameters a and Xi to be given by 



(16.31) 


(16.32) 


The second of Eqs. (16.30) is also of the form (10.31) if we put f = ar? 
with <x given by (16.32); Xi is replaced by 


Xj = ^ (16.33) 

We now treat Eq. (16.31) as we did (16.7). The asymptotic behavior 
is dominated by the factor f, where we must take the minus sign in 
the exponent. The series that multiplies this starts with a terra f*, 
where it is readily shown that s = ±im. We therefore substitute 

/(f) = c-»?f*WL(f) (16.34) 

into (16.31) and obtain as the equation for L 

fL" + (|m| + 1 - f)i' + [Xi - i(|m| + 1)]L = 0 (16.35) 
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As with Eq. (16.12), the nonterminating solutions for L cause the wave 
function (16.34) to becomes infinite for large f. The terminating solu- 
tions are the associated Laguerre polynomials; comparison of Eqs. (16.20) 
and (16.36) shows that they are LSj^J^,^i(f), where 

= Xi - i(|m| + 1) (16.36) 


is a positive integer or zero. 

In similar fashion, the solution of the 7\ equation shows that the 
number 

ns = Xa ~ i(|ml + 1) (16.37) 

is a positive integer or zero. From Eqs. (16.36) and (16.37) we obtain 
Xi Xa = ni + Ha ”1” |ni| -h 1 = n (16.38) 


where n is a nonzero positive integer. The energy levels are given by 
combining Eqs. (16.32), (16.33), and (16.38): 



2ju 




in agreement with Eq. (16.15). The energy level En is degenerate, since 
according to (16.38) there are various ways in which the three quantum 
numbers ni, na, and m can be combined to make up n. For m = 0, there 
are n ways of choosing ui and na; for |m| > 0, there are two ways of 
choosing m ( = ±\'m\)j and n — \m\ ways of choosing ni and na. Thus the 
total degeneracy of the energy level En is 

n-l 

n 2 y (n “ ]m|) = n + 2 

in agreement with the earlier result. 

Wave Functions. It is clear from the foregoing discussion that the 
unnormalized hydrogen-atom wave functions in parabolic coordinates are 

IxZe.^ 

“ AHn. + n^ + H + 1) 


n{n — 1) — 


n(n 


1)1 

— ^ 


For a particular energy level and magnetic quantum numberm(n > \m\), 
the parabolic quantum numbers rii and can be chosen such that 
yii + n 2 == n — |m| — 1; that is, in n — |m| different ways. Similarly, 
for given n and m, the azimuthal quantum number I in the spherical 
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solution can be chosen such that |m| g Z ^ ti — 1, and so also in n — \m\ 
different ways. Thus the n — \m\ products of the f and rf functions are 
linear combinations of the n — |m| products of the r and 0 functions. 

The ground-state energy level provides a particularly simple illus- 
tration of the connection between the parabolic and spherical solutions. 
].n this case, ni — 712 — m = 0, and the parabolic solution is simply 

e . Also, n = 1, Z = ?n = 0, and the spherical solution is e . 

It is apparent from Eq. (16.25) that these two solutions are identical. 


Problems 


1. Apply the Bohr-Sommerfeld quantization rules (see Sec. 2) to the determina- 
tion of the energy levels of a harmonic oscillator and of the circular orbits in a hydrogen 
atom. Compare with the results obtained in this chapter. 

2. What is the order of magnitude of the spread of quantum numbers and energies 
of the states that contribute significantly to the oscillating-wave-packet solution for 
the harmonic oscillator? 

3. Use the generating function for the Hermite polynomials to evaluate 



iln(x)x^Un(x)dx, 


where the u^b are normalized harmonic-oscillator wave functions. 

4. Use the generating function for the Legendre polynomials to evaluate 


5. Obtain an approximate analytic expression for the energy level in a square well 
potential (I — 0) when 7oa® is slightly greater thsLmr^h^/8m. 

6 . Show that for a square well potential the values of Voa^ that just bind new 

energy levels with an Z value greater than zero are given by where the numbers 

z are the nonvanishing solutions of the equation = 0 (see footnote 1, page 80). 

7. Assume that the interaction between the neutron and the proton that make up 
a deuteron can be represented by a square well potential with a = 2.80 X 10“i3 cm. 
If the lowest (Z =* 0) energy level of the system is —2.18 Mev (million electron-volts), 
calculate Vo in Mev to three significant figures. How does the answer compare with 
that which would be obtained from the approximate formula derived in Prob. 5? 

r 

8. Consider Eq. (14.17) with I = 0 and V(r) = «. (numso variiihlos 

^ 

from r to 2 ! = e and show that Bessel’s equation results. What boundary (Condi- 
tions are to be imposed on z, and how can these be used to dotcrmincc tlic (uicrgy levels? 
What is the lower limit to Fo for which a bound state exists? 

9. Find expressions for the eigenfunctions and energy levels of a particle in a 
two-dimensional circular box that has perfectly rigid walls. 

10. It is shown in Sec. 9 that a one-dimensional square well potential has a bound 
state for any positive V oa®, and in Sec. 16 that a three-dimensional square well poten- 
tial has a bound state only for Vaa^ > ir»AV8m. What is the analogous situation for a 
two-dimensional square well potential? What, if any, is the physical significance of 
these results? 
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11. The Schrodinger equation for a rigid body that is constrained to rotate about a 
fixed axis and that has a moment of inertia I about this axis is 

at " 21 d<f^^ 

where is a function of the time t and of the angle of rotation <fi about the axis 

What boundary conditions must bo applied to the solutions of this equation? Find 
the normalized energy eigenfunctions and eigenvalues. Is there any degeneracy? 

12. Fmd the energy levels of a three-dimensional isotropic harmonic oscillator 

(y(?*) =» by solving the wave equation in cartesian coordinates. What is the 

degeneracy of each level? Show that this equation can also be separated in spherical 
and in cylindrical coordinates. 

13. Show that the expectation value of the potential energy of an electron in the 
nth quantum state of a hydrogen atom is — From this result, find the 
expectation value of the kinetic energy. 

14. Find the normalized hydrogen-atom wave functions in parabolic coordinates 

for 71 = 2, = 0. Express them as linear combinations of the corresponding wave 

functions in spherical coordinates. 

16. Discuss the parities, if any, of the hydrogen-atom wave functions in parabolic 
coordinates. 
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solution can be chosen such that lm| g Z g 71 — 1, and so also in n — \m\ 
different ways. Thus the n — \m\ products of the f and 77 functions are 
linear combinations of the n. — jm] products of the r and 6 functions. 

The ground-state energy level provides a particularly simple illus- 
tration of the connection between the parabolic and spherical solutions. 
In this case, ni = n 2 = m = 0, and the parabolic solution is simply 

MZeH$+v) 

e . Also, n = 1 , Z = m = 0, and the spherical solution is e . 

It is apparent from Eq. (16.25) that these two solutions are identical. 


Problems 

1. Apply the Bohr-Sommerfold quantization rules (see Sec. 2) to the determina- 
tion of the energy levels of a harmonic oscillator and of the circular orbits in a hydrogen 
atom. Compare with the results obtained in this chapter. 

2. What is the order of magnitude of the spread of quantum numbers and energies 
of the states that contribute significantly to the oscillating-wave-packet solution for 
the harmonic oscillator? 

3. Use the generating function for the Hermite polynomials to evaluate 




U7i(x)xhi„i{x)dx, 


where the w’s are normalized harmonic-oscillator wave functions, 

4. Use the generating fimction for the Legendre polynomials to evaluate 




Pi {w)Pif(;w)dw, 


6. Obtain an approximate analytic expression for the energy level in a square well 
potential (I = 0) when Voa^ is slightly greater than Tr%^/8ni. 

0. Show that for a square well potential the values of Van® that just bind new 
energy levels with an I value greater than zero are given by h^z^l2rn, whore the numbers 
z are the nonvanishing solutions of the equation = 0 (see footnote 1, page 80). 

7. Assume that the interaction between the neutron and the proton that make up 
a deuteron can be represented by a square well potential with a — 2.80 X lO^^® cm. 
If the lowest (Z = 0) energy level of the system is “2.18 Mev (million electron-volts), 
calculate Fo in Mev to three significant figures. How does the answer compare with 
that which would be obtained from the approximate formula derived in Prob. 5? 

_ r 

8. Consider Eq. (14.17) with I *= 0 and F(r) - ~VqC ('Ihangc variables 

T 

from r to z e and show that Bessers equation results. What boundary c-ondi- 
tions are to be imposed on z, and how can these be used to determine the eiu'rgy lev(ds? 
What is the lower limit to Fo for which a bound state exists? 

9. Find expressions for the eigenfunctions and energy levels of a particle in a 
two-dimensional circular box that has perfectly rigid walls. 

10. It is shown in Sec. 9 that a one-dimensionaJ square well potential has a bound 
state for any positive Fo«®, and in Sec. 15 that a three-dimensional square well poten- 
tial has a bound state only for Foa* > T^h^/Sm, What is the analogous situation for a 
two-dimensional square well potential? What, if any, is the physical significance of 
these results? 
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I!- SulirOdiuKur equation for a rigid body that is constrained to rotate about a 
lixod axw imd that has a moment of inertia I about this axis is 

at ■" 

wluTt^ is a fiincl.iou of the time t and of the angle of rotation 4> about the axis 

What iuHuuliiry (‘.ontlitious immt be applied to the solutions of this equation? Find 
tla* norinali/AMl ouorgy eigiaifimetions and eigenvalues. Is there any degeneracy? 

12. h'iud the (Maa’gy levi'ls of a three-dimensional isotropic harmonic oscillator 

IK''")* solving t.h(‘ wave equation in cartesian coordinates. What is the 
th‘gruor{u*y of i*ac.U Icvt'l? Show that this equation can also be separated in spherical 
and in i'ylindrictLl coordiniitcs. 

13. Show that th(^ (^xj>cM',t.atioii value of the potential energy of an electron in the 
/Uh iinantum stato of a hydrogen atom is — 2V/aon^ From this result, find the 
ax|H*rlMtion value cjf the kiuotie energy. 

14. 1-i !id the uornnili/icd hydrogen-atom wave functions in parabolic coordinates 
for a ■ 2, m - 0. lOxiinw them as linear combinations of the corresponding wave 
funetioiiH in HpluM’ieal coordiiuitea. 

IB. I >i.sens.M the. parities, if any, of the hydrogen-atom wave functions in parabolic 
e<iordinat.<‘.s. 



CHAPTER V 

CONTINUOUS EIGENVALUES : COLLISION THEORY 


Problems for which the energy eigenvalues are continuously dis- 
tributed usually arise in connection with the collision of a particle with a 
force field. The method of approach is different from that employed in 
the preceding chapter. There the boundary conditions at great distances 
were used to determine the discrete energy levels of the particle. In a 
collision problem, the energy is specified in advance, and the behavior of 
the wave function at great distances is found in terms of it. This 
asymptotic behavior can then be related to the amount of scattering of 
the particle by the force field. 

As in Chap. IV, the relatively few exact solutions that are obtained 
here are of wider application than might at first seem to be the case, 
since they can serve as foundations for approximate calculations on more 
complicated systems. It is interesting to note that the study of collisions 
is particularly important in connection with atomic nuclei (see Sec. 41) 
where relatively little information can be obtained in other ways. 


17. ONE-DIMENSIONAL SQUARE POTENTIAL BARRIER 
We consider first the one-dimensional collision of a particle with the 

square potential barrier V{x) shown in 
Fig. 14. In this problem we are inter- 
ested in a particle that approaches from 
the region of negative x and is reflected 
or transmitted by the barrier. In the 
corresponding classical problem, the 
particle is always reflected if its energy 
is less than that of the top of the barrier, 
and always transmitted if its energy is 
greater. We shall see that in the 
quantum problem, both reflection and 
transmission occur with finite proba- 
bility for most energies of the particle. Because of the lack of symmetry 
between positive and negative x that is introduced from the beginning, 
it is disadvantageous to deal with solutions that have definite parities, 
and so there is no reason for making V {x) symmetrical about a; = 0, as 



Fig. 14. One-dimensional square po- 
tential barrier of height Vo and thick- 
nees a. 
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was done in Sec. 9. We assume, therefore, that V{x) == 0 for a; < 0 and 
ic > a, and V^x) = To for 0 < a; < a, where Fo is positive. 

Asymptotic Behavior. We are interested in representing a particle 
that approaches from the left with energy E > Oj and may be turned back 
by the potential barrier or penetrate through it. Thus the asymptotic 
behavior (in the regions where V (x) = 0) is as follows: for a; < 0, we want 
the wave function to represent a particle moving to the left (reflected 
particle) as well as to the right (incident particle) ; f or rc > a, we want the 
wave function to represent only a particle moving to the right (trans- 
mitted particle). 

A particle in a force-free region that is moving in a definite direction 
with a definite energy necessarily has a definite momentum, and hence 
can be represented by a one-dimensional momentum eigenfunction 

ipx 

w(x) oc e ^ if the particle is moving in the positive x direction with the 

ipx 

momentum p, and u{x) ^ e * if the particle is moving in the negative x 
direction with the same energy. Thus since the wave equation in the 
regions where V(x) = 0 is 


2m dx’^ 


Eu 


our asymptotic solutions are 


u{x) = Cc'**, a: ^ o 

where fr = p/h = (2mB/ft^)Ms the magnitude of the propagation number. 
The solutions (17.1) arc appropriate for the force-free regions that are 
external to any scattering potential, whether or not it has the simple form 
shown in Fig. 14. 

Normalization. The physical meaning of the coofficienta A, B, and C 
can be inferred by substituting (17.1) into the one-dimensional form of 
the probability current density given by Eq. (7.3). 


S{x) - v(\A[^ - IBl*^), .r < 0 
8{x) = X > a 


(17.2) 


where v = Kk/m is the speed of a particle with propagation number k. 
Since these expressions arc independent of Xy the discussion of Sec. 7 
shows that they may be ini;Crproted as the neb flux (positive to the right) 
in the two regions. This interpretation is consistent with the statement 
above that A, B, and C are the amplitudes of the incident, reflected, 
and transmitted wave functions, respectively. 

The absolute normalization of the wave functions (17.1) is unim- 
portant for this problem; this is because wc are interested only in the 
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ratios of (Bl^ and \C\^ to \A\^j which are respectively the reflection and 
transmission coefficients for the barrier. It is sometimes convenient, 
however, to normalize the incident wave function to unit flux; this cor- 
responds to taking A = 1/vK Such a normalization must not be inter- 
preted as indicating that u{x) represents more than one particle; rather 
it means that we choose a large enough number of systems [each described 
by u(x)] that are identical, independent, and nonoverlapping in the 
sense of Sec. 7, so that the total incident flux in all of them is unity. A 
more precise but sometimes less convenient normalization would assume 
a one-dimensional "box” of length L with periodic boundary conditions, 

and require that \u{x)\Hx = 1. 

Scattering Coefficients. The character of the solution inside the 
potential barrier depends on whether E is greater or less than Fo. Sup- 
pose first that B > Fq, so that we can define a propagation number 
inside the barrier: a. == [2m{E — Fo)/^^]^. Then the solution inside is 

u{x) = 0 g a; ^ a (17.3) 

The continuity of u and du/dx at x = 0 and x = required by the bound- 
ary conditions provides four relations between the five coefficients. We 
can eliminate F and G, and solve for the ratios B/A and C/A. 

B __ (A*2 - a:2)(l - 

C _ ^ ^ ^ 

A {k + ay ~ (A- ~ 

The absolute squares of the ratios (17.4) are the scattering (reflection and 
transmission) coefficients 



1 + 
1 + 


4/r^a^ 

(^2 _ 

(jy — aP-y sin^ aa 


4/[-2q,2 


AE{E - Fo) 
VI sin**^ aa 
^ . VI sin- aa 
, 4B(B - Fo) 


(17.5) 


It is readily verified from (17.5) that \B/A\^ -f- \C/A\^ = 1, as would be 
expected. 

Equations (17.5) show that the transmission coefficient approacihes 



mFoa^y 
2h:^ ) 


(17.6) 


when the particle energy approaches the energy of the top of the 
barrier {JB Fq). For increasing B {E > Fo), the transmission coefficient 
oscillates between a steadily increasing lower envelope and unity (see Fig. 



Sec. 17] CONTINUOUS EIGENVALUES: COLLISION THEORY 


95 


15). There is perfect transmission when aa = tt, 27r, . . . ; i.e., when- 
ever the barrier contains an integral number of half wave lengths.^ 
Interference phenomena of this type are well known in the transmission 
of light through thin refracting layers. 



Vo 

I’la. 16 . TranHinisHian cocllicicnt of u, .'<(|iukrc hnrrior ils a function of particle energy for 
= 8 . 

The reflection and tninsmission coefficients for 0 < E < Vo are most 
easily obtained by replacing a by in E(is. (17.4), where 


^ \2m.{V, - 

^ L A '' J ' 


The result for the transmission coc^fficient is 


a 

A 


VI sinh“ jSa ^ 


(17.7) 


This dec.reascs monotonic, ally from th(^ valium (17.0) iis E decreases 
below 7(). When » I, l.Iie transmission coefficient (17.7) becomes 
very small and is given approximately by 


107i/(7o - 

v! ~ 


E) 




(17.8) 


Figure 15 is a plot of the transmission coefficient computed from iO(is. 
(17.5) and (17,7) for a rather “opaciue^’ barrier: = 8. 


^Tliis (‘(T(‘(rt hIho ocr.urn wIkmi Fo < 0, in c-iisc. thn nqiijirn jiiirriiM’ hcM'oincs a 

square wnll. "I'lia HciiiUn'iiiji; (ux'dic.ic'nl.H an*, ^ivni by (17.5) if the hIkii of To is clianKt'*! 
there and in the expreKHien for a. 
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18. COLLISIONS IN THREE DIMENSIONS 

We are primarily concerned in this chapter with collisions in three 
dimensions, in which a particle collides with a fixed force field, or two 
particles collide with each other. It was shown in Sec, 16 that the prob- 
lem of the nonrelativistic motion of two particles, when the only forces 
present depend on their relative positions, can be broken up into two one- 
particle problems, of which one describes the motion of the particles 
relative to each other or to their center of mass, and the other describes 
the free motion of the center of mass. While the center of mass can be 
taken to be at rest in calculating the energy levels of the internal motion, 
as in Sec. 16, it has a definite motion in a collision that cannot be ignored 
in calculating the outcome of such an experiment. This is because the 
customary laboratory procedure consists in bombarding a particle that 
is initially at rest with another particle that carries the total energy 
jBo = + ^' of Eq. (16.5). Thus the energy E of the relative motion of 

the two particles is different from the bombarding energy Eq^ and the 
observed scattering depends on whether the struck particle or the center 
of mass is initially at rest. 

We call the coordinate system in which the bombarded particle is 
initially at rest the laboratory coordinate system and the coordinate system 
in which the center of mass of the two colliding particles is (initially 
and ahvays) at rest the center-of-mass coordinate system. It is evidently 
easier to calculate the result of a collision experiment in the center-of- 
mass system than in the laboratory system, since only 3 degrees of free- 
dom appear in the former as compared with 6 in the latter system. The 
collision process in the center-of-mass system may then be thought of as 
one in which a particle that has the reduced mass ix = + ma) 

of Eq. (16.4) and an initial velocity v collides with a fixed scattering 
center [see the discussion of Eq. (18.9) below]. The distribution in angle 
of the scattered particles will be affected by the transformation between 
the center-of-mass coordinate system, in which the calculations are made, 
and the laboratory coordinate system, in which the observations are 
made. 

Scattering Cross Section. The angular distribution of particles 
scattered by a fixed center of force or by other particles is conveniently 
described in terms of a scattering cross section. Suppose that we bom- 
bard a group of n particles or scattering centers with a parallel flux of N 
particles per unit area per unit time, and count the number of incident 
particles that emerge per unit time in a small solid angle Aojo centered 
about a direction that has polar angles and with respect to the 
bombarding direction as polar axis. This number will be proportional 
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to Ny n, and Acoo, provided that the flux is small enough so that there is 
no interference between bombarding particles and no appreciable dimi- 
nution of the bombarded particles by their recoil out of the target region, 
and provided also that the bombarded particles are far enough apart so 
that each collision process involves only one of them. 

Then the number of incident particles that emerge per unit time in 
Awo can be written 

?liVcrn(^^0;^o)^^o (18.1) 

where the proportionality factor cro(^Q,</>o) is called the differential scatter- 
ing cross section. Since (18.1) has the dimensions of reciprocal time, 
o‘o(^o,<<)o) has the dimensions of an area. o’o(^o,^o)Aa3o is equal to the cross- 
sectional area of the parallel incident beam that contains the number of 
particles scattered into Acoo by a single target particle or scattering center. 
The integral of <ro(0o,<#>o) over the sphere is called the total scattering cross 
section 

0*0 = /o'o(^o,0o)ri?coo (18.2) 

For the collision of a particle with a fixed scattering center, the 
definition (18.1) of differential scattering cross section is equally valid in 
the laboratory and center-of-mass coordinate systems, since a scattering 
center that is fixed has an infinite effective mass and so the center of 
mass of the system does not move. For a collision between two particles 
of finite mass, however, the differential cross section (18.1) applies in 
general only to the laboratory coordinate system and to the observation 
of the scattered incident particle. It does not describe the observation of 
the recoil bombarded particle in the laboratory system, although it is 
of course possible to obtain a differential cross section for the recoil 
particle from o•o(0o,<^>o). In the center-of-mass system the differential 
cross section o’(0,<^>) may be defined in analogy with (18.1), where again 
the scattered incident i)article is the one that is observed and the flux N 
of the incident particle is computed with respect to the bombarded 
particle, not the center of mass. Since in this coordinate system the two 
particles move in opposite directions away from each other after the 
collision, it is clear that the differential cross section for observation of 
the recoil bombarded particle in the direction 0,<^) is just aiw — Qj<l> + t). 

Relations between Angles in the Laboratory and Center-of-mass 
Systems. The relation between the differential cross sections and angles 
in the laboratory system and in the center-of-mass system can be found 
by translating the laboratory system in the direction of the incident 
particle with sufficient speed to bring the center of mass to rest. Figure 
I6a shows a particle of mass Wi and initial speed v striking a particle of 
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mass m2 that is initially at rest; the center of mass moves to the right with 
the speed v' = + m2), as may be seen from the conservation of 



m2V 




Fig. 10. (a) Laboratory coordinate syHtein, in wliirli the target particle of innse ms is 

initially at rest. (/;) Center-of-masH coordinate wyMtein, in whicli the center of maHS is 
initially and always at reHt. (c) Vector addition of volocnty of center of nnwis in laboratory 
system (y) to velocity of observed particle in center-of-niasH .syKtem (v") to rivo velocity 
observed in laboratory system (vj) ; if v” < w', cannot exceed the aiiRlc ,sin~* 


momentum. Thus in the center-of-mass system the particles of masses 
rrix and m2 approach the center of mass with speeds 


p" = 2; — if 


m^v 

(mi + m2)' 


and v\ respectively; they evidently recede from the center of mass after 
the collision with the same speeds [see Fig. 16(&)]. It follows from the 
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geometry of the situation that 6 and 0 are related to do and 4>q by 

v" cos 6 + = Vi cos 6q 

v'* sin d = Vi sin 0 o 
<#> = 00 


(18.3) 


From the first two of Eqs. (18.3) we obtain on elimination of Vi 


tan 00 


sin 6 

7 + cos 6^ 


v' __ mi 

^ m2 


(18.4) 


Equations (18.3) and (18.4) can be generalized by considering a col- 
lision (for example, a nuclear reaction) in which a particle of mass mi 
strikes a particle of mass m 2 initially at rest, and after the collision, 
particles of masses m 3 and m^ emerge, where mi + m 2 = ms + 

If also an amount of energy, Q, is converted from internal energy to 
kinetic energy of the emergent particles (Q is positive for exothermic and 
negative for endothermic collisions), and the particle of mass ma is 
observed, the first of Eqs. (18.4) is still valid. In this case 7 is still equal 
to the ratio of the speed of the center of mass in the laboratory system 
to the speed of the observed particle in the center-of-mass system. How- 
ever, 7 is no longer mijm-i, but can bo shown to be given by 


m E 

E + Q) 


(18.5) 


where 7^ = 7nitn2V-l2(mx + mo) is the energy initially associated with the 
relative motion in the (umter-of-mass system [see the discussion of Eq. 
(18.9) below]. 

Relation between Cross Sections. relation between the cross 

sections in tile laboratory and (^ent,(M-of-inass coordinate systems can be 
obtained from tluMr definitions, which imply that the same number of 
particles arc scattered into the dilTcrential solid angle r/wo about 00, 00 as 
are scattered into dco about 0 , 0 . 


o'o(0o,0n) sin 0i)d0od0o = cr(0,0) sin ddOdtj) (18.6) 


With the hell) of Ihe last of M(]S. (18. 3) and the first of J0(|s. (18.4), Eq. 
(18.6) gives 


O'o(0o,0n) — 


(1 + 7''" + 27 c.os 0) 


I L + 7 cos 0| 


- <7(0, 0 ) 


(18.7) 


where in general 7 is given by K(\. (18.5). It should be noted that the 
total cross section is the same for l)o|.h laboratory and cemter-of-mass 
systems, aiul also for ])olh outgoing ])n.r 1 icles, since i he i,oial number 
of collisions that f ake place is indej)endent of the mode of description of 
the process. 
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Dependence on y. For 7 < Ij Eq. (18.4) shows that ^0 increases 
monotonically from 0 to tt as ^ increases from 0 to t. For y = 1, da — 
and varies from 0 to ^ as 0 varies from 0 to tt; in this case 

O‘o(^0j^o) = 4 cos 0oO’(20O,^o), 

and no particles appear in the backward hemisphere in the laboratory sys- 
tem, For 7 > 1, ^0 first increases from 0 to a maximum value sin‘“^(l/7), 
which is less than as 6 increases from 0 to cos“^ ( — 1/7); ^0 then 
decreases to 0 as 0 increases further to tt. In this case o-o(0o,<Ao) is usually 
infinite at the maximum value of ^0, although this singularity gives a 
finite contribution to the total cross section; no particles appear beyond 
the maximum ^0 in the laboratory system. The two values of d that 
give rise to a particular value of ^0 between 0 and sin'^ (I/7) can be 
distinguished by the energy of the observed particle, which is greater 
for the smaller 9. 

This last case (7 > 1) is illustrated schematically in Fig. 16c. The 
resultant of the velocity v'' of the observed particle in the center-of-mass 
system and the velocity v' of the center of mass in the laboratory system 
gives the velocity Vi of the observed particle in the laboratory system. 
The locus of the terminal points of Vi when its origin is at the point 0 
is the circle of radius v". Thus when y" < y', the angle do of the resultant 
Vi with the bombarding direction cannot exceed the angle 



As the ratio 7 = v^/v^' decreases, the circle gets relatively larger and the 
angular range of Vi increases. 

The use of geometrical relationships in the foregoing discussion is 
valid in a quantum-mechanical system as well as in a classical system. 
This is because they are essentially relations between momentum vectors 
that are applied in the asymptotic region where the particles need not be 
precisely localized in space and hence can have definite momenta. 

It is interesting to note that the difference between laboratory and 
center-of-mass systems is negligible in the collisions of electrons with 
atoms, because of the large mass ratio of the colliding particles. In 
nuclear collisions, however, the difference between the two coordinate 
systems is usually significant. 

Asymptotic Behavior. The differential scattering cross section 
<r(9,<t>) in the center-of-mass coordinate system can be found from the 
asymptotic form of the solution of the second of Eqs. (16.5), 
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which is the wave equation for the relative motion. The wave function 
w may be written as a function of the angles e,4> of Fig. 16(6) and the radial 
distance r between the two particles. From Eq. (16.4) the reduced ttiubh 
is M + m 2 ). The energy E associated with the relative 

motion is easily seen from Fig. 16 to be 


E = 


mi + ma 


(18.9) 


where Eq is the initial energy of the bombarding particle. It is interesting 
to note that E is equal to the kinetic energy of a particle whose mass is 
the reduced mass /x and whose speed is the relative speed v. Thus we can 
think of Eq. (18.8) as representing the collision of a particle of mass /x, 
initial speed and kinetic energy E = Avith a fixed scattering center 
that is described by the potential energy V (r) ; then r is the vector distance 
from the fictitious particle ju to the origin of the scattering potential. 

As in Sec. 17, the scattering is determined by the asymptotic form of 
u(rjej<t}) in the region where F = 0. When the colliding particles are 
far apart, we Avant u to contain a part that represents an incident particle 
of mass fi moving in a particular direction with speed v, and a part that 
represents a radially outgoing particle: 


u{r,e,<l,) > .1 + »-'/(0,,^)«‘*1, A- = -^ (18.10) 

The first term in h]q. (18.10) represents a particle moving in the positive 
z direction, or along the polar axis 0 = 0, since z = r cos 0; it is an infinite 
plane wave of the form of the momentum eigenfunction (11.2), where the 
propagation vector k has the magnitude k and is directed along the polar 
axis. The second term in Eq. (18.10) represents a particle that is moving 
radially outward; its amplitude depends on 0 and and is inversely 
proportional to r since the radial flux must fall olT as the inverse square 
of the distance. It is readily verified that Eq. (18.10) satisfies the wave 
equation (18.8) asympl.otically throiigli terms of order l/r in the region 
in which F = 0, for any form of the funct-ion f{0,<l>). 

Normalization. The physical meaning of the coefficient A and the 
angular function / can be inhu’red from a cahmlation of the particle flux, 
as in Sec. 17. A straightforward sul)stitution of E(|, (18.10) into Eq. 
(7.3), however, yields interfcMHun^e terms hetAveen the incident and 
scattered Avaves that do not appear in most expoi’imental arrangements; 
that they do not api)ear can be s(U)n in the following way. 

In practice, the incident and s('attered particles are separated from 
each other by collimating on(^ or the other. Suppose, for example, that 
the experimental arrangement is as shoAvn schematically in Fig. 17, so 



102 


QUANTUM MECHANICS 


[Chap. V 


that the bombarding particles from the source S are collimated by dia- 
phragms DD into a fairly well-defined beam. Such a collimated beam 
is not an infinite plane wave of the form but can be made up 
by superposing infinite plane waves that have propagation vectors of 
slightly different magnitudes and directions. The total angular spread 
in radians will be of the order of the ratio of the wave length of the particle 
to the diameter of the collimating aperture, and so can be made extremely 
small in a practical case. Now /usually does not vaiy rapidly with angle, 
so that the small directional spread of the incident propagation vectors 
does not affect / significantly. Thus at the point of observation, P, only 
the /term is present, and it is essentially the same as that Avhich appears 
in Eq. (18.10). The incident flux can be calculated from the plane wave 
term alone, since if we go far enough from the scattering region, the / 



Fig. 17. Sohematic diagram of a laboratory arrangement for tlie ineaKurement of aoatter- 
ing, in which there is no interference between the incident and acatterod waves at tlie point 
of observation P. 

term can be made negligible. Thus in the region of observation, the 
interference terms are a consequence of the idealization implicit in assum- 
ing an infinite plane wave in Eq. (18.10), and usually have no physical 
significance.^ 

Substitution of the two terms of Eq. (18.10) separately into Va\. (7.3) 
shows that the incident flux is of magnitude v\A[-^ along the polar axis, 
and that the leading term in the scattered flux is of magnitude 

v\Ame,.i>)\yr^ 

along the outward radius. From the definition of cross section, it follows 
that 

= |/(0,<^>)|‘^ (18.11) 

As discussed in Sec. 17, the choice of the coefficient A is unimportant 
so far as the calculation of the scattering is concerned. The wave fun(^- 
tion may be normalized to unit incident flux l)y choosing A = 1/yi, or it 
may be normalized by making S\u\^dT = 1 over a large box that has 

^ For a somewhat exceptional case, see the diseus.sion of Kq. (10.24') in tlu^ iie.xt 
section. 
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periodic boundary conditions. We shall often simply set A equal to 
unity. 

19. SCATTERING BY SPHERICALLY SYMMETRIC POTENTIALS 

The asymptotic behavior of the wave function determines the dif- 
ferential scattering cross section, but cannot itself be found without solv- 
ing the wave equation (18.8) throughout all space. As was the case with 
the calculation of energy levels considered in Chap. IV, this can be done 
only when the wave ecpiation is separable, and a particular case of great 
physical interest is that in which the potential energy is spherically 
symmetric. We assume here that 7 is a function only of r, and find the 
connection between the solutions separated in spherical polar coordinates 
and the asymptotic form (18.10); this procedure is called the method of 
'partial waves. 

In the remainder of this chapter we shall not, for the most part, 
distinguish between collisions of a particle with a fixed scattering center, 
and collisions between two particles treated in the center-of-mass coordi- 
nate system. 

Asymptotic Behavior. It is apparent that the problem now possesses 
S 3 rmmetry about the polar axis, so that Uy /, and a- are independent of the 
angle The general solution of Ecp (18.8) then has the form (see Sec. 
H) 

□0 00 

u{r,d) = y Ri{r)Pi{coH 6) = Vr-ixiCO^^zCcos 0) (19.1) 


where Pi is the Legendre polynomial of order Z, and xi satisfies the equation 



- m - 


i{i + 1 ) 


(/(r) = 


2gF(r) 

A2 


Xi = 0 
^ 0 


(19.2) 


The boundary condiUon af r = 0 ibat Ri be finite or xi vanish determines 
the asymptotic; form of tiie solution (19.2) except for an arbitrary multi- 
plicative constant. 

In order to find the general nature of this asymptotic behavior, we 
considei- r to be so large that the II and I terms in (19.2) can be 
neglected. Then the solution of lOq. (19.2) is one of the forms To 

get a better approximation, we put 

Xi(r) = v(r)e^^^ (19.3) 


where v is assumed to be a slowly varying function of r for large r. Sub- 
stitution of (19.3) into (19.2) gives 
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V" ± 2ikv’ = ^U(x) + V (19.4) 

where primes denote differentiation with respect to r. If now we neglect 
v" in comparison with Jcv\ since v is supposed to be slowly varying, Eq. 
(19.4) can be integrated to give 

±2iklnv - j j^J7(r) + j + constant (19.5) 

Thus if U(r) falls off more rapidly than 1/r for large r, the integral on the 
right side of (19.5) converges and v approaches a constant as r becomes 
infinite. This justifies the assumption made above that v is slowly 
varying. The Coulomb field, for which U is proportional to 1/r, thus 
requires special attention, and will be discussed in Sec. 20. 

The asymptotic form of xz(?") can then be written quite generally 

Xi(r) > A'l sin (kr + (19.6) 

+ 00 

where thus far A'l and 5^ can be complex. The solution of (19.2) that 
vanishes at r = 0 is unique except for a multiplying constant. It can 
be shown that this solution is real everywhere if it starts out to be real at 
r = 0, since k, U, and I are all real. Thus must be real, although A'l 
need not be. This being the case, it is readily verified that the total 
radial flux of particles through a large sphere vanishes; 

lim 2jrr^ Sr sin dd9 = 0 

where Sr is the radial component of the vector (7.3) calculated by sub- 
stituting u{r,6) from Eq. (19.1) into it. This means that there are no 
sources or sinks of particles present, and the particles that are scattered 
radially outward are supplied by the incident plane wave. 

Scattering Cross Section. It is convenient to redefine the amplitude 
Ai and phase angle that appear in Eq. (19.6) in terms of a somewhat 
more specialized problem. It will be assumed that U(r) can be neglected 
for r greater than some distance a; in cases of practical interest, a may be 
small enough so that the I term in (19.2) is not negligible. For r > 
the most general form for Ri{r) that is real (except possibly for a complex 
multiplying constant) is shown in Sec. 15 to be 

Riif) = A 2 [cos hiiO^r) — sin 5znz(A*r)] (19.7) 

where is real; according to (15.8), this has the asymptotic form 
Ri{t) ^ {kr)-'^Ai sin {kr — + bi) 

J ► «0 

Equations (19.6) and (19.8) agree if Ai = ItA'i and 6z = 6j + ^lir. 


(19.8) 
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We now wish to identify the asymptotic form of (19.1) with (18.10). 
To do this, we require an expansion of e"** = in Legendre poly- 

nomials:^ 

00 

gtArcoHfl = y (2Z + l)i^ji(kr)Pi{cos 9) (19,9) 

z-o 

Substituting the asymptotic form of (19.9) into (18.10) with A = 1, 
and equating this to the asymptotic form of (19.1), we obtain 


^ (21 + l)z«(/fr)-i sin (kr - ■|Zx)Pi(co8 6) ,+ r-V(0)e®' 


00 



Ai{kr)~'^ sin {kr — -J-Ztt + 6i)Pi{qob 6) 


When the sine functions are written in complex exponential form, the 
coefficients of e’*'' and of on the two sides of this equation must be 
equal to each other: 


00 oo 

2ikf(d) + y (2Z + l)i*e-»"’'Pj(cos(9) = V 9) 

z «o z »0 

oo 00 

y {21 + l)i!^ei»*^Pz(cos ” y i^'^^Pi(cos 6 ) 

1=0 z=o 


(19.10) 


Since these are true for all values of 6 and the J^egendre polynomials are 
orthogonal to eacli other, the second of Ec^s. (19.10) becomes 


Ai = {21 + l)i^e'»' 

Substitution of this into the first of Eqs. (19.10) gives for the scattering 
amplitude 

00 

f{e) = (2^A•)-' y {21 + l)(c2«' ~ l)Pz(cos 6) (19.11) 

z=o 

Thus the dilTerential cross sect.ion is 


^(0) = |/(fl)|. = 1 


(21 + !)(:“' sin 8iPi(coH 6) 


(19.12) 


^G. N. Watson, “Th(sory of Ucssol Fiitu^tioiis,” rnv. od., p. 128 (Macmillan, Now 
York, 1944). 
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Because of the orthogonality of the Legendre polynomials, the total cross 
section contains no products of factors involving different values of l\ 

oo 

cr = 2 t (r(6) sin ~ ^ ^ (2^ + 1) (19.13) 

Phase Shifts. The angle 5? is called the phase shift of the Zth partial 
wave, since according to (19.8) it is the difference in phase between the 
asymptotic forms of the actual radial function Ri(r)j and the radial func- 
tion ji(kr) in the absence of scattering potential (f7 = 0). The phase 
shifts completely determine the scattering, and the scattering cross section 
vanishes when each of the di is 0° or 180°. 

It should be noted that the derivation of (19.11) is valid whether or not 
there exists the assumed radius a beyond which U{r) is negligible, pro- 
vided that U(r) falls off more rapidly than 1/r. However, the method 
of partial waves is most useful for computing scattering cross sections if 
such a radius a does exist, especially if ka is of the order of or less than 
unity. The reason for this is that the first and largest maximum of 
ji{kr) lies roughly at r = l/k, and that for r much smaller than this, ji is 
small and increases about as [see Eq. (15.7)]. Thus if a <3C Z/A*, ji will 
be very small where U is appreciable; then the Zth partial wave will 
hardly be affected by the potential, the phase shift Si will be very small, 
and the contribution to the scattering from that I will l)e negligible. It 
follows then that the scattering cross section consists of a sei'ies of terms 
extending from Z = 0 to a maximum Z that is of the order of ka. Since 
the computation of the phase shifts is usually a tedious affair, the smaller 
the magnitude of ka the easier the method is to apply. Thus this method 
of partial waves is most useful at low bombarding energies. 

It is interesting to note that the classical distance of closest approach 
of a free particle of mass p, velocity v, and angular momentum Ih to the 
origin is Ih/fxv = l/k. Thus the foregoing remai’ks are analogous to the 
statement that a classical particle is not scattei'ed if it has sufficient 
angular momentum so that it does not enter the potential region 7* < a. 

Calculation of 8i. The phase shift 8i is computed by fitting the radial 
wave function Ri(r) for r < a, which may have an analytics foi'm and can 
always be found numerically if necessary, to the exterior solution (19.7). 
The boundary condition at r = a is that (\/Ri){dRi/(lr) he continuous. 
Thus if 7 z is the ratio of slope to value of the interior Avave function, we 
have that 

Z<^[iz(A'u) cos 8i — n[(ka) sin _ 
ji{ka) cos Si — ni(ka) sin 8i 
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where the derivatives and nj may be rewritten with the help of (15.10). 
Then 5? is given by 


tan 5z = 


kni(ka) — yini{ka) 


(19,14) 


Equation (19.14) can be used at once to obtain an approximate expres- 
sion for 8i when I » ka and 8i is expected to be small. In this case, ji 
will differ little from the ratio of slope to value of the solution in the 
absence of a scattering potential, so that we put 


7z 


= k\i 

iJi 


ji{ka) 


+ ez 


« 


flika) 


jiika) 


Equation (19.14) can be rewritten with the help of (15.9) as 

€i(kayjf(ka) 


tan 8i = 


ei(kayji(ka)ni{ka) — 1 


(19.15) 


(19.16) 


which is still exact. If now we make use of the power series expansion 
for jz from (15.7) when (ka)'\ and use (15.7) and (15.8) to estimate 
the order of magnitude of //z, the inecpiality in (19.15) becomes 




(19.17) 


and (19.16) may be approximated as 


^ 

[1 . 3 • 5 . . . {2L + 1)]‘-^ 


6z22^(Z!)2(A-a)2«+2 

m + 1)!]‘^ 


(19.18) 


Equation (19.18) can be used to verify the convergence of the sums 
over partial waves such as appear in (19.11). We use Stirling's formula 
to find the heading terms in In \8i\ when I is large, and neglect terms of 
order In Z and lower. 

In \8i\ ^ In \ei\ + 2Z[ln (ka) + 1 — In 2] — 21 In I (19.19) 

Thus even if |ez| has the maximum value indicated by (19.17), h falls off 
like the inverse; fac.ioi’ial of I (faster than exponentially), and the series 
that appear in the expressions for the scattering converge quite rapidly 
for large Z. 

Relation between Signs of 5z and V{r). It is apparent from (19.18) 
that when I » (A*a)-, di has the o|)positc sign from ez. If now the potential 
energy term V or U is positive, corresponding to forces that are mainly 
repulsive, Eq. (19.2) sliows that the ratio of curvature to value for the 
radial wave function is more positive than in the foi-ce-free case. This 
means that the ratio of slope to value is more positive at r = a than is 
the case if U = 0. Thus a repulsive potential makes ei positive and 8i 
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negative. A negative phase shift means that the radial wave function is 
^'pushed out’’ in comparison with the force-free wave function. 

In similar fashion, we see that a negative potential makes ei negative 
and 8i positive. This means that the radial wave function is pulled 
in” by the attractive potential. 




Fig. 18. Schematic plots of the effects of (a) positive (repulsive) potential, and (6) nega- 
tive (attractive) potential, on the force-free radial wave function ji{kr)\ the range of the 
potential is a in each case. Ri(r) is drawn arbitrarily to start out like ji{kr) at r — 0, and 
is bent up more rapidly in (a) so that it has a greater amplitude and a retarded phase 
(pushed out) with respect to ji(,kr). In (6), Ri{r) bends over sooner, and thus has a smaller 
amplitude than ji(,kr) and an advanced pha^e (pulled in). The amplitudes have no direct 
physical significance, whereas the phases determine the scattering. The difference between 
neighboring nodes of ji and Ri is not precisely equal to the phase shift divided by k (as 
indicated) until jz has gone through several oscillations and attained its asymptotic form. 

These conclusions are valid even when I is not large compared to ka 
and di is not small. This may be seen graphically by comparing ji{kr) 
and Ri(r) when they are arbitrarily made to start out in the same way at 
r = 0. Figure 18(a) shows a schematic comparison for positive F, and 
Fig. 18(6) for negative F. 

Ramsauer-Townsend Effect The construction in Fig. 18(6) suggests 
that an attractive potential might be strong enough so that one of the 
radial partial waves is pulled in by just half a cycle and its phase shift is 
180°. If this were the case, the corresponding term in the expression 
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(19.11) for f{d) would vanish, and there would be no contribution to the 
scattering. It is clear from the foregoing discussion that the phase shift 
is largest for I = 0. The possibility then arises that ka can be small 
enough and the attractive potential large enough in magnitude so that 
5o = 180® and all other phase shifts are negligibly small. In such a case, 
the scattered amplitude f(d) vanishes for all 6, and there is no scattering. 

This is the explanation^ of the Ramsauer-Townsend effect, the ex- 
tremely low minimum observed in the scattering cross section of electrons 
by rare-gas atoms at about 0.7 electron-volt bombarding energy.^ A 
rare-gas atom, which consists entirely of closed shells, is relatively small, 



Fig. 19. Schematic plot of the effect of the potential of a rare-gas atom of “radius” a on the 
Z = 0 partial wave of an incident electron that has the minimum cross section observed 
in the Ramsauer-Townwcnd effect. As in Fig. 18, the actual and force-free wave functions 
start out in the same way at r = 0; the former is “pulled in” by 180° of phase. In an 
actual case, the quantity ka would be somewhat smaller than is indicated here. 

and the combined force of nucleus and atomic electrons exerted on an 
incident electron is strong and sharply defined as to range. Thus it is 
reasonable to expect that a situation such as that illustrated in Fig. 19 
could occur. Here the partial wave with 1 = 0 has exactly a half cycle 
more of oscillation inside the atomic potential than the corresponding 
force-free wave, and the wave length of the electron is large enough in 
comparison with. a so that higher I phase shifts arc negligible. It is clear 
that this minimum cross section will occur at a definite energy, since the 
shape of the wave function inside the potential is insensitive to the rela- 
tively small bombarding energy whereas the phase of the force-free wave 
function depends rapidly on it. 

Physically, the Ramsauer-Townsend effect may be thought of as a 
diffraction of the electron around the rare-gas atom, in which the wave 
function inside the atom is distorted in just such a way that it fits on 

1 This explanation, sup;gostcd by N. Bohr, was shown to be quantitatively reason- 
able by H. Fiixdn and J. Holtsmark, Zeits.f. Physik, 46, 307 (1927). 

® The experiinontal results arc suinninrizcd by R. Kolhith, Phys. Zrits., 31, 985 
(1931). 
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smoothly to an undistorted wave function outside. This effect is anal- 
ogous to the perfect transmission found at particular energies in the one- 
dimensional problem considered earlier [see discussion of Eq. (17.6)]. 
Unlike the situation in one dimension, however, the Ramsauer-Townsend 
effect cannot occur with a repulsive potential, since ka would have to be 
at least of order unity to make 5o = — 180°, and a potential of this large 
range would produce higher I phase shifts. 

Scattering by a Perfectly Rigid Sphere. As a first example of the 
method of partial waves, we compute the scattering by a perfectly rigid 
sphere, which is represented by the potential F(r) = for r < a, 
and V(r) =0 for r > a. The solution for r > a is just Eq. (19.7). The 
boundary condition, obtained in Sec. 8, that u(a,d) = 0, is equivalent to 
the requirement that all the radial functions vanish at r = a. The phase 
shifts may then be obtained by setting either Ri{a) given by (19.7) equal 
to zero, or yi in (19.14) equal to infinity: 

tan Si (19.20) 

ni{ka) 

The calculation of the scattering is particularly simple in the low- 
energy limit: ka = 27ra/X <3C 1. Then substitution of (15.7) into (19.20) 
gives as an approximation for the phase shifts 

tan Si^- + i)[i . 1 .^3 -V • ■ (21 - 1)]'^ (19.21) 

Thus 8i falls off very rapidly as Z increases, in agreement with (19.19). 
All the phase shifts vanish as A* 0; however, the I = 0 partial wave 
gives a finite contribution to the scattering because of the factor l/A-‘^ that 
appears in (19.12) and (19.13). We thus obtain 

<T{e) ^ a2, 0- ^ 47ra2 (19.22) 

The scattering is spherically symmetrical, and the total cross section is 
four times the classical value. 

In the high-energy limit (A*a 1), we might expect to get the classical 
result, since it is then possible to make wave packets that are small in 
comparison with the size of the scattering region, and these can follow 
the classical trajectoiies without spreading appreciably. This corre- 
sponds to the ray limit in the wave theory of light or sound. The dif- 
ferential scattering cross section is rather difficult to find, and we only 
indicate the computation of the leading term in the total cross section. 
Substitution of (19.20) into (19.13) gives 
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Itt Y (2; + l)jl(A-a) 

Z/ + nf(A-a) 

z=o 


(19.23) 


We can make use of asymptotic expansions of Bessel functions that are 
valid when the argument is large and the order is smaller than, of the 
order of, and larger than the argument.^ The calculation shows that 
most of the contribution to the sum in (19.23) comes from 


I < {ka) - (7(/ra)i, 

where C is a number of order unity; the leading term here is ^{kay. 
The other two parts of the sum, for (ka) — C(ka)^ < I < (ka) + C{ka)^, 
and for I > (ka) + C{ka)^, each contribute terms of order (ka)^, and hence 
may be neglected in the high-energy limit. Thus 

(T ^ 27ra2 (19.24) 


which is twice the classical value. 

The reason for the apparently anomalous result (19.24) is that the 
asymptotic form of the wave function is so set up in Eq. (18.10) that in 
the classical limit the scattering is counted twice: once in the true scatter- 
ing (which turns out to be spherically symmetric as it is in the classical 
problem), and again in the shadow of the scattering sphere that appears 
in the forward direction, since this shadow is produced by interference 
between the incident plane wave e'*® and the scattered Avave 
However, so long as ka is finite, diffraction around the sphere in the for- 
ward direction actually takes place, and the total measured cross section 
(if the measurement can be made so that it includes the strong forward 
maximum) is approximately 27ra2. 

Scattering by a Square Well Potential. As a second example of the 
method of partial waves, we consider the somewhat more complicated 
problem of the scattering from the spherically symmetric square well 
potential illustrated in Fig. 13 of Sec. 15. The interior (r < a) wave 
function that is finite at r = 0 is seen by analogy with Eq. (15.11) to be 


Riir) = a = 25) 

Thus the phase sliii'ts ai’e j^iven by Eq. (19.14), where the ratio of slope 
to value of the Zth partial wave at r = a is 


_ oifiiaa) 


(19.26) 


^ Watson, op. rit., Chap. VIII. 
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In the low-energy limit (ka <SC 1), substitution of (15.7) into (19.14) 
gives for the first two phase shifts 


tan 5o = — 


yoka^ 

1 + yod 


tan 5i 


^ (A-a)^ 1 - Tia 
3 2 + yia 


(19.27) 


Unless yoa = — 1 or 71a = —2, both of these vanish as A* — > 0. As with 
the rigid sphere, however, the I = 0 partial wave gives a finite contribu- 
tion to the scattering because of the factor l/k‘^ that appears in (19.12) 
and (19.13). From Eq. (19.26), we see that yoa = aa cot aa — 1, so that 

cr ^ W ^1 - (19-28) 

The scattering is spherically symmetrical. 

The conclusion reached here and in connection with the rigid sphere 
that the low-energy scattering is substantially independent of bombard- 
ing energy and angle of observation is almost always valid for any 
potential that has a finite range. Exceptions can arise, as pointed out 
after Eq. (19.27), if any one of the yi is such that the denominator of 
the expression for tan bi is very small. In such a situation, the Zth partial 
wave is said to be in resonance with the scattering potential; then it usu- 
ally dominates the scattering. 

Resonance Scattering, An approximate expression for the resonance 
cross section can be obtained by making use of the fact that yi decreases 
linearly with increasing a when a is sufficiently close to ao = (2/x7o/ft^)** 
Increasing a causes the interior wave function to bend over more rapidly 
and so decreases the ratio of slope to value at r = a. Now 


a 


(«g + ^ ao + 


when k is small, so that we can write to lowest order in k 

yia = 7?a — bi(kay 


where 7? is the value of yi when a = ao, and bi is a positive number of 
order unity.^ Substitution into (19.27) and then into (19.12) gives for 
the leading term in the differential cross section, in the two cases for 
which the value of I for the partial wave that is in resonance is 0 and is 1, 


^ It can be shown that the Zth partial wave is exactly in resonance at zero bombard- 
ing energy when 7 fa = — (Z 4- 1) ; in this case 6z = § for all 1. Compare with footnote 
1, page 80. 
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(fo - + {kay 

9a^ cos^ d {kaY 

(fi - 6iW)2 + ikaY' 


I = 0 (19.29) 
I = 1 (19.30) 


We have put fo = yla + 1 and fi = y\a + 2; for resonance, |fol and |f i| 
are small compared to unity. It is easy to show that (19.29) is a mono- 
tonically decreasing function of ka) however (19.30) has a maximum at 
ka = (f i/6i)^ if f 1 is positive, and decreases monotonically with ka if 
f 1 is negative. 

If we make use of the relation fo = oioa cot aoa, we see from (19.29) 
that a suitable approximation for the total cross section when the 1 = 0 
partial wave is in resonance is 


c 


4t 


/r“ + al cot^ aoa 


(19.31) 


It is apparent that the I = 0 partial wave is in resonance at low bombard- 
ing energies whenever aoa is approximately an odd multiple of 7r/2, so 
that Voa^ ^ wW/SfjL, OrV/Sn, etc. The discussion of Eq. (15.3) shows 
that these are just the values of for which new energy levels with 
I = 0 appear. It is true quite generally that a potential well (not neces- 
sarily square) that has an energy level nearly at zero exhibits a resonance 
in the low-energy scattering of particles with the same I value (not neces- 
sarily zero) as the energy level. From a physical point of view, we can 
say that an incident particle that has nearly the right energy to be bound 
by the potential tends to concentrate there and produce a large distortion 
in the wave function and hence a large amount of scattering. 

Resonance maxima in the low-energy scattering like that found above 
for Z = I (;an appeal* for any Z value except Z = 0, provided that the 
potential well is not ({uite dee]) or broad enough to contain a new energy 
level of that angular momentum (this corresponds in the case of the square 
well to having small and positive). We can think of such a potential 
physically as containing a virtual energy level slightly above zero. While 
a discrete enei*gy Uw'el c.annot exist with positive energy, the positive 
'‘centrifugal poteni-iar’ /(Z + [see the discussion of Eq. (14.18)] 

for Z > 0 a(4.s as a potent, ial barrier that impedes the escape of a particle 
that IS in I, he virtual encn'gy level. Figure 20 illustrates this barrier, 
which is charac-t(n*ized by a small transmission at low energies in the same 
way as is the bari-ier of Fig. 14 [see the discussion of Eq. (17.7)]. Thus 
the virtual level has a kind of transient existenc,e and produces a greater 
distortion of the incident wave function at its energy than at neighboring 
energies. 
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Angular Distribution at Low Energies, When the bombarding energy 
is small but not zero, the partial wave I = 1 may have an observable effect 
on the scattering. If only 5o and 8i are appreciably different from zero, 
Eqs. (19.12) and (19.13) become 

(y (6) = Y 2 So + 6 sin 8q sin 6i cos {8i — 8o) cos d 

+ 9 sin^ 5i cos^ 6] (19.32) 

O' = ^ (sin^ 3o + 3 sin^ 8i) 

In the absence of resonance, we see from Eqs. (19.27) and (19.32) that 
the ratio of the contributions to the total cross section of the partial waves 
I = 1 and Z = 0 is of order {kay. However, the ratio of the largest angle- 



Fig. 20. Effective potential energy lV(r) plus “centrifugal potential”] for Z > 0, when 
V = 0 for r > a. The dashed portion for r < o depends on tlie shape of V, The effective 
potential barrier (r > a) resembles the barrier of Fig. 14 in that it has a small trans- 
mission for E slightly greater than zero,, 

dependent term in the differential cross section (which is proportional to 
cos Q) to the constant term is of order {kaY. 

Thus the partial wave Z = 1 manifests itself in the angular distribution 
at a lower energy than that at which it becomes significant in the total 
cross section; this is because of its interference with the stronger partial 
wave Z = 0. For example, if 5o = 20° and 5i = 2° at a particular bom- 
barding energy, the partial wave Z = 1 contributes only 3 per cent to the 
total cross section Avhile it makes the forward scattering {d = 0°) 3.5 
times as great as the backward scattering {d = 180°). 

20. SCATTERING- BY A COULOMB FIELD 
It was noted in Sec. 19 that the Coulomb field is an exceptional scat- 
terer so far as the application of the method of partial weaves is concerned. 
If V{r) = for a collision between particles of charges Tte and 

Z'c, it is easily seen that Eq. (19.3) becomes asymptotically 

Xz(^) ^ In r) 


( 20 . 1 ) 
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Here n = ixZZ'e-lh^k = ZZ'e^ltiv^ where v is the relative velocity and ii is 
the reduced mass. Thus the radial solutions never approach the sinus- 
oidal free-particle solutions, since there is always a logarithmic contribu- 
tion to the phase at great distances that cannot be neglected. Although 
it is still possible to obtain a solution of this scattering problem in spheri- 
cal coordinates (this is given below), the phase shifts hi introduced in 
Sec. 19 are altered in meaning. This section presents the analytical 
work in outline, and quotes the principal results from more extended 
treatments.^ 

Parabolic Coordinates. So long as <r(0) for a pure Coulomb field is all 
that is desired, it is simpler to work with the separation of the wave 
equation in parabolic coordinates (see Sec. 16) than in spherical coordi- 
nates. The reason for this is that the desired solution depends almost 
entirely on the variable ^ defined in Eq. (16.25), and not on the other two 
variables rj and <l>. It is apparent that the solution will not involve <!>, 
because of the axial symmetry of the problem; if now the incident plane 
wave term e'*® is taken out as a factor, it can be made plausible that the 
rest of the solution does not involve r?. We put 

Uc = c^f ( 20 . 2 ) 

where Ue represents the complete Coulomb wave function (incident plus 
scattered wave). Now iio must contain a part whose dominant asymp- 
totic behavior is of the form hiii no part that goes like 

[see Eq. (18.10)]. Since an expression — z) can be of this form, 

while an expression c‘*^/(r + z) cannot, we anticipate that the function / 
appearing in (20.2) will depend only on ^ = r — 

We substitute Eq. (20.2) into Eep (16.26) after replacing Z by —ZZ', 
and remember that E > 0. The dilTerential eciiiation for / is then 

I ^ + (1 - iki) ^ - nkj = 0 (20.3) 

The confluent hypergoometr-ie ecuiation 
d-F dF 

+ = 0 (20.4) 

which has the solution F{a,h,z)j is equivalent to Eq. (20.3) if we put 

m) = CF{-m,l,ik0 (20.5) 

where (7 is a constant. 


1 W. Gordon, Zeih. /. Phy.vh, 48, ISO (1028); N. F. Mott and H. S. W. Massey, 
“The Theory of Atomic (\)llision.s,'’ ('hap. Ill (Oxford, New York, 1933). For the 
mathematical l)M,(':kgroun(l, s(^(^ T. Whitiaken’ and G. N. V/atson, “A Course of 
Modern Analysis,” -Ith (^d., Chap. XVI (Cambridge, London, 1935). 
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Confluent Hypergeometric Function. The solution of Eq. (20.4) that 
IS regular at 2 = 0 can be written as a power series 


F(fl,h,z) = V r(a + s)V{h)^ 

^ 4 f(^)r(6 + a)r(i +7) 

. fi «0 


1 + 


az 

m 


I a(a + 1)2^ , 
bib + 1)2! ■ 


( 20 . 6 ) 


It is convenient to put F(a,b,z) = W^(a,b,z) + W,ia,b,z), where TTi and 
W, are separate y solutions of Eq. (20.4). An asymptotic expansion for 
r can then be obtained from the following relations: 


= f ^-a) a - 6 + 1, - 2 ) 

Wi{a,b,z) = e' 2 “-»p(l - a,b - a,z) (20.7) 

giccM >l+^, + ^ + 1 ) . 

2->» 2l! 22 2! ^ 


The solution of Eq. (20.4) that is irregular at the origin can be taken to be 

6ia,b,z) = iWiia,b,z) — iWi{a,b,z) (20.8) 

We shall require the irregular solution for problems in which the Coulomb 
held does not extend in to r = 0. 

Tim asymptotic form of the Coulomb wave function can be obtained 
rom Eqs. (20.2), (20.5), and (20.7). The result through terms of order 

0"“* 1 TCI 


IS 


Uc >• r-r jeilAs-n'nfcCr-*)] Tj 1 

r(l +211) ( I ikir-z)\ 

+ /-■/„(0)e»(*”-»‘" 2*>-) I (20.9) 

where ' 

i (e'\ = ^(1 + in) 

fr(— in) 2k sin''® 

° (20.10) 
Vo = arg r(l + in) 

■A Section and Normalization. The/<, term on the right 

side of Eq. (20.9) represents the outgoing scattered wave, since it is the 

appears. The first term in Eq. 
(20.9)_ similarly corresponds to the incident “plane” wave; the multi- 
p icative actor —n^/ik(r 2 ) can be ignored in the asymptotic region. 



Sec. 20] CONTINUOUS EIGENVALUES: COLLISION THEORY 


117 


Both the incident and scattered waves are diverted at infinite distances 
by logarithmic phase factors. In accordance with Eq. (18.11), the dif- 
ferential scattering cross section is 

- '/-wi’ - (w£^J 

“ (^ 1 ?) ( 20 . 11 ) 

This is just the formula obtained by Rutherford from classical dynamics, 
and verified experimentally by him for the collisions of alpha particles 
(helium nuclei) with heavier nuclei. It should be noted, however, that 
the angle-dependent part of the phase factor in the scattered amplitude 
fc(0) given in Eq. (20.10) can manifest itself in a nonclassical way when 
the colliding particles are identical (see Sec. 32). 

If the incident beam is normalized to unit flux, the constant C must 
be chosen to be 

C =: 2.- Jr(l + 


so that the Coulomb wave function is 

Uc = 

= ir ir(l + in)e- sin^ ^d) (20.12) 

Then the particle density at r = 0 is found from the power series expan- 
sion (20.6) to be 


|wc(0)|- = |C1“ = ?'“^|r(l + 
_ 27nr 

“ ;r(^7‘^n^^rT) 


For small collision speeds {\n\ » 1), Ecj. (20.13) tells us that 

|//r(0)l“ ^ attractive case, v. < 0 

|//f(0)|“ = c repulsive (\‘ise, n > 0 


(20.13) 


(20.14) 


The second of Eqs. (20.14) is of some practical interest. The expo- 
nential is the dominant factor in the production of reactions between 
positively charged nuclei at low bombarding energies, when the nuclear 
radii may be assumed small enough so that the colliding nuclei have to 
approach to zero distance in order to initiate a reaction. In this case 
exp { — 27rZZ'e-lhv) is called the Gamow factor and is the dominant term 
in the rate of many nuclear reactions at low bombarding energies. 

^ G. Gamow, Zeits. /. Physik, 51, 204 (1928); R. W. Gurney and E. XJ. Condon, 
Fhys. Rev., 33, 127 (1929). 
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Solution in Spherical Coordinates. In nuclear collision problems, 
such as that of the scattering of protons of several million electron-volts 
energy in hydrogen, the departures from the Coulomb law of interaction 
at small distances between the colliding particles can affect the scattering- 
cross section. Such problems can be treated by a modification of the 
method of partial waves, developed in Sec. 19, in which an expansion in 
spherical harmonics is made for the pure Coulomb field, and modifica- 
tions introduced for the first few I values. In order to apply such a 
technique, we require first a solution for the pure Coulomb scattering in 
spherical partial waves. 

W^e put 

oo 

= y Ri(r)Pi(cos 6) (20.15) 

1 = 0 

where the radial wave equation is 

1 1 / , ^z\ ^ ^ m + 1) 

dr \ dr J [_ ^ 

If we substitute Ri(r) -= r^e^^^fi(r), the equation for fi becomes 

r ^ + [2ikr + 2(1 + 1)] ^ + [2ik(l + 1) - 2nk]fi = 0 (20.17) 

This is equivalent to the confluent hypergeometric equation (20.4), and 
has as its solution that is regular at r = 0 

fi{r) = CiF{l + 1 + in, 21 + 2, -2ikr) (20.18) 

The asymptotic form of (20.18) can be found from (20.7), and gives for 
the radial wave function at great distances 

(2fryr(r+f+in^ Fr ^ 

where rjt = arg r(Z + 1 + in). 

The coefficients Ci must be determined so that the partial wave expan- 
sion (20.15) is identical with the solution (20.12) in parabolic coordinates. 
Because of the orthogonality of the Legendre polynomials, we have the 
relation 

Ri(r) = Piicos e)uc(r,e) sin BdO (20.20) 

where Uc(r,6) is given by the second of Eqs. (20.12). The complete 
evaluation of this integral can be avoided by making use of the fact that 
we know all about the function Ri{r) except the constant multiplying 


= 0 ( 20 . 16 ) 
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factor Cl. Cl can then be found by matching the known form of Ri{r) 
to Eq. (20.20) near r = 0, and turns out to be 

^ + 1 + in) 

vi(2l)\ 

We thus obtain as an alternative expression to (20.12) 

00 

== y m +^1 + in) 

1^0 

•F{1 + 1 + in, 21 + 2, -2ikr)Pi(cos 0). (20.21) 

Modified Coulomb Field. If the actual potential deviates from the 
Coulomb form only at small values of r, we expect in analogy with the 
partial wave treatment of section 19 that only the first few terms in 
the sum (20.21) will be altered. Since each partial radial wave function 
must be a solution of (20.16) outside of the potential anomaly, the only 
change we can make in the function/? and still have it a solution of (20.17) 
is to add in some of the irregular solution G{1 + 1 + in, 21 +- 2, — 2zlr) 
defined by (20.8). The manner in which G is to be added in is determined 
by the requirement that the complete wave function shall represent 
asymptotically a Coulomb incident plus scattered wave, plus an extra 
outgoing scattered wave. 

We must, therefore, substitute for each F term in (20.21) a linear com- 
bination of F and G in which the amount of the ingoing term is not 
changed. Such a combination is 

e'^i[F cos hi + G sin hi) = W + 1^2. 

The modified wave function, which is a solution of the wave efiuation 
outside of the potential anomaly, can then be written 

oo 

(2*r)‘r»^ 

/T() 

• - \)Wi{l +■ 1 + in, 21 + 2, -2ikr)Pi[v.o^ d) (20.22) 

The asymptotic foi’in of is 

Un, > V i / (2/ + l)/yC'^Vl-S/>(/iT)-l 

r-> M 

/-() 

■ sin {kr ~ J/tt — 7i In 2/(T + + 5z)P?(cos B) (20.23) 

As shown in connection with lOcp (19.0), each term on the right side of 
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(20.23) must be a real function of r, except for complex multiplying fac- 
tors, so that the 5i must be real. 

The additional phase shifts di can be found by matching each partial 
radial wave in Eq. (20.22) to the interior solution at the edge of the 
potential anomaly, in just the same way as the phase shifts were found 
in Sec. 19. While in Sec. 19 the phase shifts represented the departure 
of the wave function from that of a free particle, they here represent 
the departure from the wave function of a particle scattered by a pure 
Coulomb field. ^ It can be shown from Eq. (20.22) that the asymptotic 
form of Um may be written in the form of (20.9), where /c(0) is replaced by 


fm(e) = fciS) + y k-^(2l + 1)c*( 2’>/+5P sin 5iPz(cos 6) (20.24) 

1 = 0 

The differential scattering cross section is just |/m(0)|^, and in general con- 
tains interference terms between the Coulomb scattered amplitude fc{S) 
and the extra terms that are determined by the 8i. 

Classical Limit for a Pure Coulomb Field. As was discussed in Sec. 
12, we expect the results of quantum and classical theory to coincide 
whenever it is possible to construct wave packets that follow the classical 
trajectory without spreading appreciably and are small enough so that 
the forces are sensibly constant over their dimensions. A wave packet 
cannot be made much smaller than the reduced wave length 

X = A = A 

27r fiV 


of the relative motion if it is not to spread apart at once. For a repulsive 
Coulomb field, the distance over which the force varies by an appreciable 
fraction of itself may be taken to be of the order of the classical distance 
of closest approach. This also provides a useful estimate for an attractive 
field, since in all the collisions except the relatively few for which the 
particles are scattered through large angles, they never get closer than 
this distance from each other. 

Thus the classical limit corresponds to making X small in comparison 
with lZZ'eV^/xz; 2 |j or 


n = 


hv 


» 1 


Large n implies that the angle-dependent part of the phase of fc(6) given 
in (20.10) varies rapidly with 9, so that these rapid oscillations in the 

1 The computation of the Si in Eq. (20.22) requires knowledge of G at small r; 
useful formulas have been given by P. L. Yost, J. A. Wheeler, and G. Breit, Phys. Rev., 
49 , 174 (1936). 
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scattering amplitude should have little effect on the scattering when the 
colliding particles are identical (see Prob. 6, Chap. IX). 

It is interesting to note that for the Coulomb field the classical limit is 
approached for small v, whereas for potentials that have a finite range a, 
such as are discussed in Sec. 19, the classical limit is approached for 
\<^a, i.e., for large v. This is because the “size” \ZZ^e^/ ijlv^\ of the 
Coulomb field increases more rapidly than \ = h/ p.v v decreases. 

Problems 

1. Show that the coefficients of scattering by a one-dimensional square well poten- 
tial (like Fig. 14 except that Fo < 0) are given by Eqs. (17.5) if the sign of Vq is 
changed there and in the expression for a. Discuss the dependence of transmission 
coefficient on ^7 in this case. 

2. Show that Eqs. (18.4) and (18.7) are valid for a general binary collision if 
Y is given by (18.5) ; make use of conservation of energy and mass. 

3. Show that when a particle of mass mi collides elastically with a particle of mass 
m 2 that is initially at rest, all the recoil (mass m 2 ) particles are scattered in the forward 
hemisphere in the laboratory coordinate system. If the angular distribution is 
spherically symmetrical in the centor-of-inass system, what is it for m 2 in the labora- 
tory system? 

4 . What must Voa^ be for a three-dimensional square well potential in order that 
the scattering cross section be zero at zero bombarding energy (Ramsauer-Townsend 
effect) ? Find the leading term in the expression for the total cross section for small 
bombarding energy. 

6. State clearly the assumptions that go into the derivation of Eq. (19.31), and 
verify that it is a suitable approximation for the total cross section at low bombarding 
energies when the 1 = 0 wave is in resonance. 

6. Make use of Eq. (19.31) and the result of Prob. 5, Chap. IV, to obtain an 
approximate expression for the total scattering cross section by a particular potential 
in terms of the bombarding energy E and the binding energy e of a particle in that 
potential, when E and e are small in comparison with V 0 . 

7. Compute and make a polar plot of the differential scattering cross section for a 
perfectly rigid sphere when ka = using the first three partial waves {L = 0, 1, 2). 
What is the total cross section in this case, and what is the approximate accuracy of 
this result when the three t(^rms arc used? 

8. Find a gcMie.ral exj)resHion for the phase shift produced by a scattering potential 
FCr) = A/r‘\ where A > 0. Is tfic total c.ross .section finite? If not, does the diver- 
gence Gonu; from small or large scattering angles, and why? What modifications are 
necessary in the calculation if A < 0? Are any difficulties encountered in this latter 
case? 

9 . Protons of 200,000 electron-volts Cinergy are scattered from aluminum. The 
directly back scattered intensity (d = ISO^*) is found to be 96 per cent of that com- 
puted from the Rutherford formula. Assume this to be due to a modilication of the 
Coulomb potential that is of sufficiently short range so that only the phase shift for 
^ = 0 is affected. Is this modification attractive or repulsive? Find the sign and 
magnitude of the change in the phase shift for 1 = 0 produced by the modification. 



CHAPTER VI 

MATRIX FORMULATION OF QUANTUM MECHANICS 

In the last four chapters the Schrodinger wave equation was developed 
and its solutions obtained in some cases of physical interest. We now 
turn to a different formulation of quantum mechanics, in which dynamical 
variables such as the coordinates, momentum components, and energy 
of a particle appear explicitly in the equations of motion of the system 
without their having to multiply or differentiate a wave function. The 
classical equations are just of this structure, so it might be expected that 
there would be a closer resemblance between the classical and quantum 
formalism here than in the Schrodinger theory. 

This is actually the case; the principal formal difference is that the 
quantum dynamical variables do not obey the commutative law of 
multiplication. It is convenient to represent such noncommutative 
dynamical variables, which are often simply called operators, as matrices. 
Matrix theory provides an expecially flexible representation scheme, since 
there are an arbitrarily large number of ways of choosing the rows and 
columns of a matrix, all of which are of equal validity. It is because of 
the close formal resemblance between quantum matrix mechanics arid 
classical dynamics that this was historically the first formulation of 
quantum theory to be discovered, by Heisenberg in 1925.^ 

In this chapter we first review briefly the more important properties 
of matrices, and then show their connection with quantum theory and 
their usefulness in particular problems. 

21. MATRIX ALGEBRA 

We restrict our discussion at first to matrices that have a finite num- 
ber of rows and columns, and then indicate how the results obtained can 
be taken over to matrices that have an infinite number of rows and 
columns.^ 

1 W. Heisenberg, Zeits. f. Physik, 33, 879 (1925); M. Born, W. Heisenberg, and 
P. Jordan, Zeits. f. Phys. 36, 557 (1925). The connection between quantum matrix 
mechanics and the wave equation was established by E. Schrodinger, Ann. d. Physik, 
79, 734 (1926), and C. Eckart, P%s. Rev., 28, 711 (1926). 

2 For a fuller discussion, see J. von Neumann, Mathematische Grundlagen der 
Quantenmechanik,” Chap. II (Springer, Berlin, 1932; reprinted by Dover, New 
York). 
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Matrix Addition and Multiplication. A matrix is a square or rec- 
tangular array of numbers that can be added to or multiplied into another 
matrix according to certain rules. We denote a matrix by a capital 
letter, such as A, and the numbers or elements that make it up by the same 
letter with subscripts, such as Au\ here, k designates the row and I the 
column in which the matrix element Au appears. Two matrices can be 
added when they have the same rank, i.e., the same number of rows and 
the same number of columns. Addition is commutative: 

A + JS = B + A (21.1) 

If the sum matrix is called C, then 

Cki = Aki + Bki ( 21 . 2 ) 

A matrix A can be multiplied from the left into a matrix B if the num- 
ber of columns of A is equal to the number of rows of B ] then the product 
matrix C has the number of rows of A and the number of columns of B. 

C = AB, Cm = 2 (21.3) 

m 

where the summation is over the subscript m, which denotes the columns 
of A and the rows of B. It follows at once from Eqs. (21.2) and (21.3) 
that the distributive law of multiplication is valid. 

A{B + C) = AB + AC (21.4) 

Also, the associative law of multiplication is seen to be valid: 

A{BC) = (AB)C (21.5) 

where the left side means that A is multiplied from the left into the 

product of B and C, and the right, side means that the product of A and B 

is multiplied from the left ini.o C. The product (21.5) is written simply 
as ABC and, from (21.3), has the explicit expression 

D = ABC, Dm = ^ ^'‘■■‘B^^nCni (21.6) 

m,rt 

It is clear from (21.3) that AB is not in general equal to BA; thus 
the commutative law of multiplication is not generally valid. 

Null, Unit, and Constant Matrices. For an arbitrary square matrix 
A, the null matrix O is defined by the ecjuations 

Od = O, AO = O, (21.7) 

from which it follows that, all the elements of O are zero. If A is not 
square, the elements of O are still all zero, but the O’s that appear at 
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different places in (21.7) do not all have the same numbers of rows and 
columns. 

The unit matrix 1 is defined by 

\A = A, £1 = B, (21.8) 

for arbitrary matrices A and B. From Eqs. (21.8) it follows that 1 is a 
square matrix whose rank (number of rows or columns) equals the num- 
ber of rows of A or the number of columns of B. Moreover, 1 must have 
unit elements along its principal diagonal {k = 1) and zeros elsewhere, so 
that the elements of 1 equal the ICronecker symbol hi introduced in 
Sec. 10. ^ 

The product of a number c and a matrix A is the matrix cA that results 
from multiplying each element of A by c. Thus if we define a constant 
matrix C to be a multiple of a unit matrix so that each nonvanishing 
element is c instead of unity, then 

cA = (7A, where Cki ~ (21.9) 

are the matrix elements of the constant matrix (7. 

Spur, Determinant, and Inverse of a Matrix. The spur of a square 
matrix, often called the trace or the diagonal sum^ is the sum of the 
diagonal elements of the matrix: 

Sp{A)='^A^, (21.10) 

k 

The determinant of a square matrix is found from the usual rule for the 
computation of the determinant of a square array of numbers. 

A matrix A may or may not possess an inverse which is defined 
by the relations 

= 1, A-iA = 1 (21.11) 

A is said to be nonsingular if it possesses an inverse, and singular if it does 
not. If A is square, the kl element of its inverse is just the cofactor of 
Aik divided by the determinant of A; in this case A is singular if its 
determinant vanishes. It is readily verified that for nonsingular matrices 
A, £, C 

(A£C)-i = C-^B-^A-^ (21.12) 

Hermitian and Unitary Matrices. The Hermitian adjoint A* of a 
matrix A is the matrix obtained by interchanging rows and columns and 
taking the complex conjugate of each element; thus if 


B=A* then Bki-^Aik 


(21.13) 
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It is readily verified that the Hermitian adjoint of the product of a series 
of matrices is the product of their adjoints in the reverse order 

(ABC)* = C*jB*A* (21.14) 

A matrix is Hermitian or self-adjoint, if it is equal to its Hermitian 
adjoint; thus, A is a Hermitian matrix if 

A = A* (21.15) 

Evidently only square matrices can be Hermitian. 

A matrix is unitary if its Hermitian adjoint is equal to its inverse; 
thus A is a unitary matrix if 

A* = A-S or AA* = 1 and A*A = 1 (21.16) 

Unitary matrices need not be square. 

Transformation and Diagonalization of Matrices. We define the 
transformation of a square matrix A into a matrix A' by a nonsingular 
matrix S, by the following equation: 

SAS-^ = A' (21.17) 

It is evident then that transforms A' back into A. 

The form of a matrix ecpiation is unaffected by transformation. Thus 
the equation 

AB + CDE = F 

may be transformed into 

Nvi + BGDES-^ = SFS^^ 
which is e(iuival(‘nt. t o 

SAS-^ • • SDS-^ • SES-^ = SFS-^ 

or to 

/1'B' + C'D'FA = F' 

where the primes denote transformed matrices. This invariance of 
matrix equations with respect to transformations makes it possible to 
work with any convenient transformation of a set of matrices without 
affecting the validity of any results obtained. 

A square matrix is diagonal if it has nonvanishing elements only along 
the principal diagonal (/r = 1). The diagonal elements are then called 
the eigenvalues of the matrix. It is easily seen that the n\h. power of a 
diagonal matrix is also diagonal, and has as its eigenvalues the nth 
powers of the eigenvalues of the original matrix. The matrix A in Eq. 
(21,17) is said to be diagonalized, by the matrix S if the matrix A' that 
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results from the transformation is diagonal, so that A^i = A[^hh To 
find A' explicitly, we multiply (21.17) through on the right by 8. 

SA == A'S (21.18) 

The set of linear algebraic equations that are obtained by writing out 
the elements of Eq. (21.18) for a particular row k and all columns Z is 

^ SkmAml = A'j^kl or ^ Skm{Aml “ A'k^ml) = 0 (21.19) 

m m 


where is a particular eigenvalue of A' and the subscript m is summed 
over from unity to the rank N of the matrix A. 

Now (21.19) may be regarded as a set of N homogeneous algebraic 
equations for the transformation matrix elements Skm, where k is fixed. 
The necessary and sufficient condition that these equations have a solu- 
tion is that the determinant of their coefficients vanish, or that the 
determinant of the square matrix (A^i — Ak^mi) be zero. This provides 
a single algebraic equation, called the secular equation^ which is of order 
N and has N roots A*. Thus the eigenvalues of the diagonal matrix A' 
resulting from A by transformation are the same no matter how A is 
diagonalized, except perhaps for the order in which they are arranged; 
for this reason they are also called the eigenvalues of the original nondiag- 
onal matrix A, 

Matrices of Infinite Rank, The rules (21.2) and (21.3) for addition 
and multiplication of matrices may be taken over in an obvious way for 
matrices that have an infinite number of rows and columns, provided 
that the infinite sum in (21.3) converges. We sometimes deal with 
matrices that have a nondenumerably infinite number of rows or columns 
or both; in this case, one or both of the matrix subscripts becomes 
a continuous variable, and the usual summation over a subscript is 
replaced by integration over this variable. We do not consider these 
possibilities in detail here, but simply assume that all reasonable results 
can be taken over from finite- to infinite-rank matrices without difficulty.^ 
The statement that a Hermitian matrix of infinite rank is square means 
that its rows and columns are labeled in the same way. A unitary 
matrix of infinite rank need not be square. Its rows and columns can 
be labeled differently; for example, the number of rows may be denumer- 
ably infinite and the number of columns nondenumerably infinite. 

We are concerned primarily in quantum mechanics with Hermitian 
and unitary matrices, mainly of infinite rank. A fundamental theorem 


^ A more thorough discussion of this point, and a proof of the following theorem, 
are given by J. von Neumann, loc. dt. 
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that we shall assume without proof is that any Hermitian matrix can be 
diagonalized by a unitary transformation; a corollary of this theorem is 
that the resulting eigenvalues of the Hermitian matrix are unique, 
except perhaps for the order in which they are arranged. Starting from 
this theorem, it is not difficult to show (see Prob. 1) that the necessary 
and sufficient condition that two Hermitian matrices can be diagonalized 
by the same unitary transformation is that they commute (matrices A 
and B commute if AB = BA). 

It also follows from this theorem that the eigenvalues of a Hermitian 
matrix are real. If the S and A in (21.17) are unitary and Hermitian, 
respectively, that equation can be rewritten 

SAS* = A' (21.20) 

The Hermitian adjoint of Eq. (21.20) is, from (21.14), 

SAS^ = A'* 

Since then A'* = A', this shows that the Hermitian property is 
maintained during transformation by a unitary matrix. If A' is diagonal 
as well as Hermitian, it follows from (21.13) that its eigenvalues are real. 
It is easily seen that the converse is also true: a matrix that has real 
eigenvalues is Hermitian. 

It is important to note with matrices of infinite rank that both of Eqs. 
(21,11) must be valid in order for A~^ to be the inverse of A. Similarly, 
both of the latter pair of Eqs. (21.16) must be valid if A is to be unitary. 

22. MATRICES IN QUANTUM MECHANICS 

The appearance of matrices in quantum mechanics can be connected 
in a simple way with the solution of the Schrodinger equation (8.2). In 
this section we adopt the Hamiltonian notation, and justify it in detail 
in Sec. 23. We rewrite Eq. (8.2) as 

Hu,{t) = E,Mr) (22.1) 

where the subscript k denotes the different members of the complete 
orthonormal set of energy eigenfunctions Uk{r) and their corresponding 
eigenvalues Ek [/r specifies the energy, and also distinguishes between 
degenerate eigenfunctions; it thus includes both E and 5 of Eq. (10.7)]. 
The Hamiltonian or energy operator 11 is given by 

" “ s + (“'2) 

In accordance with the discussion of Sec;. 8, k may be a discrete or a 
continuous variable, or discrete over part of its range and continuous 
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over the rest. We shall use the symbol S ov to denote a complete 
summation over the subscript k (the summation includes an integration 
Jdk over the continuous part of its range). 

A Unitary Transformation Matrix. Suppose that we have a second 
complete orthonormal set of functions 2 ^«(r), which are not necessarily 
eigenfunctions of the Schrodinger equation (22.1) with the actual poten- 
tial energy F(r); they might, for example, be momentum eigenfunctions 
(11.4) or (11.11), or hydrogen-atom wave functions (16.24) supplemented 
by continuous-eigenvalue Coulomb-field wave functions like those dis- 
cussed in Sec. 20. The Vn can be expanded in terms of the Uki 


where it is readily verified from the orthonormality of the Uk that 


Similarly 


Skn = JUk('r)VniT)dT 


(22.3) 


We can now see that the matrix of which Skn are the elements is 
unitary. 

iSS*)ki = S’^knSin = 

= JK(r)«,(r')5(r - ndrdr' = fu,{T)uiir)dT (22.4) 

where use has been made of the closure property that is possessed by any 
complete orthonormal set of functions such as the Vnir) [see the discussion 
of Eq. (10.11)]. The last integral on the right side of (22.4) is a Kronecker 
8 symbol or a Dirac 8 function according as k is one of a discrete or a con- 
tinuous set of subscripts; in either case, this integral is equivalent to an 
element of the unit matrix, so that we have shown that SS* = 1. In 
similar fashion, it can be established that 




= Sk^knSkm = ( 1 ) 


nm 


Hamiltonian Matrix. The functions Vniv) can be used to calculate a 
Hamiltonian matrix 

Hnm = !vn{r)Hvrn(r)dT (22.5) 

where H is the operator given in (22.2). We now consider the connection 
between the Hamiltonian matrix (22.5) in the Vn representation^ and the 
energy eigenvalues 

Transformation of Hnm by the unitary matrix S gives 


{SHS*)u = 
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where the prime on H indicates that it operates only on the variable r' 
to its right. After performing the summation over the subscript m to 
get 5(r' — r"), it can be shown by means of two partial integrations that 

- T")ui{T")dT" = iuj{r"W'd(r' - x")dT^' 

= J5(r' - r")H'%ii'')dT" 

= H'uiii') 


On doing the sum over n and dropping the primes, we obtain 

{SHS^),i = iii,{t)Hui(r)dT = Ek8,i or E^dik - 1) (22.6) 

which is in diagonal form with the eigenvalues Ek. 

Thus the problem of solving the Schrodinger equation is completely 
equivalent to the problem of diagonalilzing the Hamiltonian matrix when 
it is expressed in some arbitrary representation such as that provided by 
the functions Vn. The eigenvalues of the Hamiltonian matrix are the 
energy eigenvalues of the Schrodinger equation, and the unitary trans- 
formation matrix S that diagonalizes H serves, through Eq. (22.3), 
to give the energy eigenfunctions Uk{r) in terms of the arbitrary initial set 
of functions. 

It is interesting to note that S is not necessarily a square matrix. For 
example, the functions Vn may be the eigenfunctions of a three-dimensional 
harmonic oscillator, which form a completely discrete set, while the func- 
tions Uk may be momentum eigenfunctions, which form a completely 
continuous set. However the Hamiltonian matrix, in both its diagonal 
form (22.6) and its nondiagonal form Hrm, is square. 

Dynamical Variables as Hermitian Matrices. The eigenvalues of the 
Hamiltonian are found in (22.6) to be the real energy levels Ek. It then 
follows from Sec. 21 that H is a Hermitian matrix in any representation. 

The discussion of Sec. 10 developed the interpretation that the eigen- 
values of any operator that represents a dynamical variable are real, since 
they are the only possible results of precise physical measurement of that 
variable. Any dynamicial variable can be represented as a matrix whose 
diagonal representation has its eigenvalues along the principal diagonal, 
and that can be transformed to any other representation by a suitable 
unitary matrix. Thus any physically measurable dynamical variable 
can be represented by a Hermitian matrix, and is said to be Hermitian. 

Wave Functions as Unitary Matrices. Any complete orthonormal set 
of functions, such as the ^^^(r) or the Vnir)^ possesses the closure property 


- r') 


(22.7) 
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and the orthonormality property 

juk(t)ui(x)dr = Ski or 8(k — 1) (22.8) 

If now we regard the function Ukix) as a two-dimensional array of 
numbers in which the rows are labeled by the position variable r and the 
columns by the subscript k, it is equivalent to a matrix Uik- Equation 
(22.7) is then equivalent to the matrix equation 

(UU*)„' = = ( 1 )„< 

Similarly, Eq. (22.8) is equivalent to 

iU*U)u = SrUrkU,i = (1)h 
Thus U is 8b unitary matrix. 

The r Representation. This suggests that such a unitary matrix be 
used to effect a transformation. Suppose that we transform the Hamil- 
tonian matrix Hnm given in (22.5) by means of the unitary matrix 

Vtn = Vnir). 

(FH7*)„' = = Sn,mVn(T)HnmVm(r') 

= /6(r - - i')dT" = HSir - r') (22.9) 

The last step makes use of partial integrations and implies that H operates 
only on the variable r in 5(r — r'). • 

The result embodied in (22.9) is the inverse of the definition (22.5). 
The latter takes a differential operator H and constructs from it a matrix 
representation, while the former transforms that matrix back into what is 
effectively the differential operator. However, we now see what was not 
so obvious from the form (22.2) for H: that a differential operator on the 
space coordinates can be expressed as a matrix in a representation in 
which the rows and columns are labeled by the position variables r and r'. 
From this point of view, the solution of the Schrodinger differential wave 
equation (22.1) is equivalent to the diagonalization of the matrix 

= Hd{x - X') 

just as we saw above that it is equivalent to the diagonalization of the 
matrix H nm> 

The r representation is one in which the coordinate r is diagonal: 
(r)rv' = r'5(r' — r'^). It is worth pointing out explicitly that the Hamil- 
tonian Hrr' in the r representation is not diagonal, even though the 5 
function makes it vanish when r differs from r' by a finite amount. This 
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is because derivatives of the 5 function appear, and these have nonvanish- 
ing matrix elements infinitesimally removed from the diagonal r == r'. 
For example, the matrix /(r) 5 (r — r') is diagonal, but the matrix 

(d/dx) 8(r — r') 

and the matrix V25(r — r') are not. 

A Useful Identity. If Q is an operator whose effect on a function /(r) 
can be represented as 

m) = J12(r,r')/{r0rir' 

then we can establish an identity that is sometimes useful : 

igmmw = /[OW]/(r)dr (22.10) 

If we regard 12(r,r') as a matrix in the r representation, the Hermitian 
adjoint of Q operating on ^(r) is 

from which Eq. (22.10) follows at once. 

This identity enables us to transfer operators from one factor of an 
integrand to another. The partial integrations in Eqs. (7.9), (7.10), 
and (12.3) are examples of the application of (22.10). In those cases, 
0 is a differential operator, and its matrix representation involves deriva- 
tives and multiples of the 5 function. However, Q need not be of this 
specialized type [it might, for example, be an integral operator like the 
square bracket in Eq. (10.19)], and (22.10) is valid as well for operators 
that have non vanishing matrix elements a finite distance from the 
diagonal. 

23. EQUATIONS OF MOTION IN MATRIX FORM 

In the preceding section we discussed the principal transformation 
properties of matrices that represent dynamical variables at a particular 
instant of time. We now find equations of motion for these variables by 
computing the time rate of (diange of their matrix representations. The 
resulting equations are very similar in form to the classical eciuations of 
motion and indicate a general procedure for quantizing any classical 
system. 

Time Rate of Change of a Matrix. We take as our starting point the 
time-dependent Schrcklinger wave ecpiation (6.16), expressed in terms of a 
Hamiltonian operator. 


(23.1) 
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where a typical F is that given by Eq. (22.2). A matrix representation 
for any function F of the dynamical variables can be found in terms of a 
complete orthonormal set of functions, each of which depends on the 
time in accordance with (23.1). We call any two of these functions ^ 
and 4>, and calculate the time rate of change of a typical matrix element 

= nKUW{tis:'M{T',t)dTdT' 

It is assumed that E is a general operator (not necessarily a differential or 
multiplicative operator) of the type considered in Eq. (22.10), which mav 
depend explicitly on the time. mca may 

Differentiation with respect to t gives 


~ j mdr = 11 ^(r,0 [|E(r,r',o] 
+ y/ <?(r,0E(r,r',0 


rf'(T',t)dTdr' 
^ ',t) 


dt 


drd-F 


+ 


= j .?(r,0E(r,r',0[f/V(r',f)]drd/ 

f I ’ substitution has been made from Eq. (23.1) We make use 

E^fn thetsft^"'-'"^ r ^ (which\ Wrmitian W 

/'vJ' in the last term, and obtain the equation 


It / = j 


- dF 1 


KFH - HF)Ht 


tunli'oL ■“ ?* »i»“ ‘he 

functions f and 4 , are quite arbitrary at any instant of time; 
dF dF 1 

dt ~ It (23.2) 

Jhanet^of elements are the time rate of 

dSivlo^r r* the total time 

exn^-t H ^ to f, and takes into account the 

explicit dependence of E on the time. The last term is that part of the 

time derivative of the E matrix that arises from the change in time of 
the functions with respect to ^vhich the matrix is calculated Equation 
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(23.2) is Heisenberg^s form of the equation of motion of a dynamical 
variable. 

Classical Lagrangian and Hamiltonian Equations of Motion. In 
order to bring out the similarity between Eq. (23.2) and the corresponding 
classical equation, we review briefly the structure of classical Hamiltonian 
theory. The equations of motion of a conservative dynamical system 
that has / degrees of freedom may be derived from a Lagrangian function 
^ Qfj ih - j Qf} 0 of the coordinates qi, the velocities 
qi = dqijdtj and the time, by means of a variational principle:^ 


5 j^^ Ldt = 0, Bqi{ti) = 8qi(t2) = 0 
The resulting Lagrangian equations are 



(23.3) 


(23.4) 


If now we define a momentum canonically conjugate to qi as Pi = dL/dqi, 
and a Hamiltonian function of the coordinates and momenta as 


j 

■ ■ ■ qf,Pi • • • 'p/,t) = ^ Piqi - L 


variation of H leads to the Hamiltonian equations of motion 

dH 


dH 
Qi ^ 

opi 


= - dq, 


1 


• / 


(23.5) 

(23.6) 


The time dependence of any function of the coordinates, momenta, and 
the time, calculated along a moving phase point, is 



• Qf, Vi • 



dt Z/ \dqi dpi dqi dpj 
1 = 1 


on making use of the Hamiltonian equations (23.6). 'Fhe Poisson bracket 
{ of any two functions of the coordinates and momenta is defined as 


\A,n} 


! 



dqi dpi 


1=1 


dqi dpij 


(23.7) 


‘ E. T. Whittaker, “Analytical Dynamics,” 3d ed., Hecs. 99, 109 (Cambridge, 
London, 1927). 
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In terms of the Poisson bracket, the equation of motion for the function 
F of the dynamical variables becomes 

(23.8) 

The left side of Eq. (23.8) is the total time derivative of F along a 
moving phase point, the first term on the right side takes into account the 
explicit time dependence of F, and the last term shows the change in F 
due to the motion of the phase point at which F is evaluated. Thus there 
is a strong resemblance between Eqs. (23.2) and (23.8); the effect of 
the moving phase point in the latter corresponds to the effect in the 
former of the change with time of the functions that specify the matrix 
representation. 

Poisson Brackets and Commutator Brackets. The resemblance 
between Eqs. (23.2) and (23.8) suggests that the quantum analogues of 
the classical equations of motion be found in general by substituting the 
commutator bracket divided by ih for the Poisson bracket: 

{A,B} i [A,B] ^ ^ (AB - BA) (23.9) 

There are two observations that lend support to this suggestion. The 
first concerns the classical conditions for a contact transformation from 
one set of canonical variables qi^pi to another Qi,Pi:^ 

{Qi,PA = 5./, IQiM = 0, \Pi,PA = 0 (23.10) 

where the Poisson brackets are calculated with respect to the original 
variables qi,pi. Now we saw in Sec. 6 that a successful transition from 
classical to quantum theory could be made by substituting the differential 
operator —ih{d/dx) for pxj etc. The commutator of x and can then be 
found by letting it operate on an arbitrary function gr(r) of the coordinates. 

{x'Px — Pxx)g(r) = —ihx^ + (xg) = ihg{r) (23.11) 

Since ^(r) is arbitrary, this and the other commutators may be written 
as operator equations 

dx 

A - A 

dy dy 

xy - yx = 0, p:,py - pyp:, = 0, etc, 

1 Whittaker, op. cit, pp. 300, 307, 


xpy 


— PyX = —ih 



XP: 
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These are in agreement with the classical equations (23.10) when the 
substitution (23.9) is made. 

The second observation is that the algebraic properties of the com- 
mutator brackets are identical with those of the Poisson brackets. It is 
readily verified from the definition (23.7) that 

{A,B} = [AjC] =0, where c is a number 

{(Ai + A 2 ), B\ = {Ax,fi} + {A,,B} (23.12) 

{AiAo,J5} = {Ax,B}A2 + Ai{A2,J5} 

[AAB,C}] + {BAC,A}\ + {CAA,B}} =0 
The order of possibly noncommuting factors has not been altered. 
Dirac ^ has shown that the form of the quantum analogue of the Poisson 
bracket is determined by Eqs. (23.12) to be the right side of (23.9); the 
constant h is, of course, arbitrary so far as this discussion is concerned 
(see also Prob. 10). 

Quantization of a Classical System. It is plausible on the basis of the 
preceding discussion to assume that any classical system can be taken 
over into quantum mechanics by finding the classical Plamiltonian func- 
tion and equations of motion in terms of some set of canonical variables 
QijPi, and replacing the Poisson brackets in (23.8) and (23.10) by com- 
mutator brackets in accordance with (23.9). The canonical variables are 
then found to obey the quantum conditions 

fe.Pj] = [gi,(Zy] = 0, [p,p,] = 0 (23.13) 

This quantization technique will be found useful in connection with 
classical wave fields (Chap. XIII) as well as classical particles. 

Two precautions are found necessary in applying this technique. 
First, the coordinates and momenta must be expressed in cartesian coordi- 
nates. And second, ambiguities in the order of noncommuting factors 
are usually resolved by taking a symmetric average of the various possible 
orders. These precautions are illustrated in the following example. 

Motion of a Particle in an Electromagnetic Field. As an example of 
the foregoing quantization technique, we consider the problem of the 
motion of a charged mass point in a general external electromagnetic 
field. The classical Hamiltonian, expressed in terms of the canonical 
variables r, p and the electromagnetic potentials A(r,^), (/)(r,i), is^ 



(23.14) 


^ P. A. M. Pirao, '‘The Principles of Quantum Mechanics, 2ded., Sec. 25 (Oxford, 
New York, 1935). 

2 J. H. Van Vleck, '‘The Theory of Electric and Magnetic Susceptibilities,’’ pp. 
7, 20 (Oxford, New York, 1932). Gaussian units arc used throughout the present 
book. 
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where e is the charge on the particle and c is the speed of light; the electric 
and magnetic field strengths are given in terms of the potentials by 

E = — ~ ^ — gi’ad <#>, H = curl A (23.15) 

C 06 

The quantum conditions (23.13) in cartesian coordinates are 

= [y,Vv] = (23.16) 

with other pairs of coordinate and momentum components commuting. 
We now use Eq. (23.2), with the expression (23.14) for H and the relations 
(23.16), to calculate expressions for the particle velocity dr/dt and 
acceleration dhtdt^, for comparison with the corresponding classical 
expressions. 

Evaluation of Commutator Brackets. In order to facilitate evalua- 
tion of some of the commutator brackets that arise from substitution 
into (23.2), we derive a few elementary results. Any two functions of r 
commute with each other, since all components of r commute with each 
other. It follows from (23.16) that 

x^Vx — = x{p^ ih) — 

= (Px^ + ih)x + ihx — pxX^ = 2ihx 

It is readily shown by induction that 

x^Px — PxX^' = nihx^^~^ (23.17) 

It follows from (23.17) that for any function /(r) that can be expressed 
as a power series in the relation 

[/(r),Px] = f{i)Px - Pxfir) = (23.18) 

is valid. ^ Equation (23.18) can also be established for more general 
functions than power series by making use of the representation of p® 
as —ih{d/dx), as in (23.11); if we operate with the left side of (23.18) 
on an arbitrary function ^(r), we obtain 

[/(r),pj^(r) = -ih f{r)A - A/(r) g^r) = ^(r) [*^£/(r)] 

which is equivalent to the operator equality (23.18) since fif(r) is arbi- 
trary. By repeated application of (23.18) it is easily shown that 

f(x)pl - pim = ih ^ = 2ih Px + *“ ^2 (23.19) 

'This corresponds to the classical relation (/(r),px! = df{T)/dx [compare with 
Eq. (23.9)1. 
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Velocity and Acceleration of a Charged Particle. The Hamiltonian 
(23.14) may now be written, with the help of (23.18), 

»2 - >>2 


ieh 




= ^ A -p + div A + 5 ^, 

2m me ^ 2mc 2mc^ 


A2 + e<t> (23.20) 


The time derivative of a component of r is then easily shown from (23.2) 
to be 

dx 


dt 


= — (vx — 

m \ c / 


(23.21) 


in agreement with the classical relation between the velocity and momen- 
tum of a particle in the presence of an electromagnetic field. 

The calculation of a component of the acceleration of the particle 


dH 

dt^ 


I 

m 


dpx 

dt 


X 


e dAg 
c dt _ 
e dAg 


ihm me dt 


• [Ax^H] 


ihmc 

is straightforward, but rather tedious. The result may be written 

e 


d^x _ _ ^ A I 
dt^ ~~ m \c dt dxj 


+ 


The Lorentz Force. liquation (23.22), with the similar y- and 
2 -component equations, can be written as a single vector equation for the 
'^force^' 


m 


dt^ 


+ i 

= eE +i 


e fdt 
c \dt 


X H - H X 


dr 

di. 


0 


(23.23) 


where use has been made of Eqa. (23.15) and (23.21). Equation (23.23) 
is in agreement with the corresponding (dassical expression 


eE + ; (V X H) 
c 
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where v = dx/dt is the velocity of the particle, if we take a symmetric 
average of the two terms v X H and — H X v ; these are identical clas- 
sically but differ in quantum mechanics since the v given by (23.21) does 
not commute with H. 

Equation (23.23) includes a generalization of Ehrenfest^s theorem, 
which was discussed in Sec. 7. If we consider a diagonal element, the 
left side is the product of the mass and the second time derivative of the 
expectation value of the position vector of the particle. The right side 
is the expectation value of the Lorentz force acting on the charge of the 
particle. Thus (23.23) states that a wave packet moves like a classical 
particle if it is sufficiently well localized so that the electromagnetic fields 
change by a negligible amount over its dimensions. This result can, of 
course, also be obtained by the method of Sec. 7 when, in accordance 
with (23.1) and (23.20), the Schrodinger wave equation is taken to be 

s'” + + 2^! 

Constants of the Motion. Equation (23.2) tells us that if F does not 
depend explicitly on the time (so that SF/dt = 0), then dF/dt = 0 if F 
commutes with H. In this case, F is said to be a constant of the motion. 
This is usually possible at all times only if H is also constant. If we sub- 
stitute H for F in (23.2), we see that the constancy of H implies that 
dH/dt = 0, or that H does not involve the time explicitly. Thus if H 
is independent of t, a function F of the dynamical variables of the system 
is a constant if it does not depend on t and commutes with H. 

An example of a constant of the motion is any one of the coordinates or 
momenta of the system whose canonically conjugate momentum or coordi- 
nate does not appear explicitly in H. Since the dynamical variable in 
question commutes with all other variables except its own canonical con- 
jugate, then it commutes with H. Thus if the PTamiltonian for a number 
of interacting particles does not depend on the position coordinates of 
the center of mass of the system, the total momentum of the system is a 
constant of the motion. This is in agreement with the classical result 
that the total linear momentum of a system of interacting particles is 
constant if there are no external forces. 

In similar fashion, we can make use of the third of Eqs. (14.20) to 
find the condition for constancy of the angular momentum of a particle. 
This equation states that the operator that represents the z component of 
angular momentum is = ih(d/d4>), where <i> is the angular coordinate 
of rotation about the z axis. Thus, as with (23.11), we have the operator 
relation 


<f>M, - M,(i> = ih 


(23.25) 
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and 0 and Mz may be regarded as canonically conjugate variables. If 
then H does not depend on the angular coordinates of a particle [for 
example, if V in Eq. (22.2) depends only on the radial distance r from a 
fixed center], Mz is a constant of the motion; since there is nothing special 
about the choice of the z axis, Mx and My are also constants. This is 
in agreement with the classical result that the angular momentum of a 
particle moving in a central field of force is constant. 

The Parity Operator. Parity was first discussed in Sec. 9, and defined 
there and in Sec. 14 as the property of an energy eigenfunction that speci- 
fies whether it is even or odd with respect to changes of sign of all of the 
space coordinates. The parity operator P can be introduced in quantum 
mechanics, even though it has no classical analogue; it is defined as an 
operator that reflects all coordinates of all particles through the origin: 

P/(a;i, 2/1, 31,3:2,2/2,22, .•■,<) 

= /(— 3 : 1 ,— 2/1,— 2i,— a:2,— 2/2,-22, ■ • ■ ,i) (23.26) 

It is evident from (23.26) that P^ is the unit operator 1. Thus if P is 
diagonalized the sijuare of each of its diagonal elements is unity, and its 
eigenvalues are + 1. 

If now II is left unchanged by reflection of all coordinates through 
the origin, P commutes with II and is a constant of the motion. Moreover, 
it follows from Sec. 21 that P and H can be made diagonal at once. Then 
the parity of an cnei'gy eigenfunction can be well defined (even or odd) 
and is constant in time. 

Energy Representation. It was shown in Sec. 22 that the set of 
Schrodinger energy eigenfunctions UA.(r) may be thought of as a unitary 
matrix that transforms the Hamiltonian operator from the r representa- 
tion into the diagonal form 


H,t = or E,d{k - 1) 


Although the disimssion of Sec. 22 is valid for only one instant of time, it 
can be made valid foi' all time by using the time-dependent eigenfunctions 

Uk{r)e for t,ho transformation, provided that // is a constant. The 
matrix represc'iitation in which H is diagonal is called the energy 
representation . 

The equation of motion (23.2) takes a particularly simple form in this 
representation, if A" does not depend explicitly on the time: 

= .1 (FII - IIF),, -Ue,- Ei)Fu 
(It in ri 


(23.27) 
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Equation (23.27) can be integrated to give 

i{Ek-Ei)t 

Fuit) - Fy (23.28) 

where Fh is the value of the matrix element at ^ = 0. Thus in the energ}^ 
representation, the off-diagonal matrix elements of any time-independent 
function of the dynamical variables oscillate in time with frequencies that 
are related to the energy differences between the stationary states by 
Bohr’s frequency condition (see Sec. 2). 

24. ANGULAR MOMENTUM 

As an interesting and useful example of the direct treatment of 
dynamical variables by matrix methods, we now consider the properties of 
the angular-momentum operator. We work entirely with a representa- 
tion at a particular instant of time, so that we are not concerned here 
with the change in time of the resulting angular-momentum matrices. 
However, if the angular momentum commutes with the Hamiltonian, it is 
a constant of the motion, and the matrices retain their form for all time. 
It was shown in Sec. 23 that this is the case if the Hamiltonian is spheri- 
cally symmetric. 

Definition of Angular Momentum. We define the angular momentum 
M of a particle about some point in terms of its displacement r from that 
point and its momentum p as in Eq. (14.19) 

M = r X p (24.1) 

We do not now require that p be expressible as a differential operator, but 
rather that the components of r and p satisfy the commutation relations 
(23.16). It is then possible to find commutation relations between the 
components of M that do not involve r and p. 

= (2/p. - ZPy){ZVx - XPz) - {zp„ - XPz)iyPy — ZPy) 

= yPxiPzZ — zpz) + xpy{zpz — p^) = ihixpy — ypz) 

We thus obtain 

= ihM,, [My,M.] = ihM,, = ihMy (24.2) 

Equations (24.2) are seen to apply also to the total angular momentum 
of a system of particles, since the r and p operators for the individual 
particles commute with each other and so their angular momenta do also. 
Now it turns out that the relations (24.2) admit of some matrix repre- 
sentations for M that are not compatible with the original definition 
(24.1). When such a conflict arises, there is in some cases physical reason 
to regard (24.2) as being more fundamental than (24.1) (see the end of 
this section). 
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We note that the original definition (24.1) makes M Hermitian, since 
r and p are Hermitian. This is assumed to be true in general, since the 
components of M can be Hermitian without violating the commutation 
relations (24.2). 

Choice of a Representation. It is apparent from (24.2) that no two 
of the three components of M commute with each other, and so it is 
impossible to find a representation that diagonalizes more than one of 
them. However, all three components commute with 

M2 = Ml + Ml + Ml 

for example, 

[M.,M2] = M,Ml - MIM. + M,Ml - MIM^ 

= ih{M^My + MyM:,) - ih{MyM^ + MMy) = 0 

Thus it is possible to diagonalize one component of M, say M*, and 
at the same time; we use this to define our representation. 

It is then convenient to work with M^ and the non-Hermitian matrix 

+ iMy (24.3) 

from which it follows that 

M2 = Ml + i(LZ.* + L*L) (24.4) 

The commutation relations involving L are found from (24.2) to be 

[M'\L] = 0, [MJA = fiL, [L,L*] = 2hM. (24.5) 

Our object is to find a representation in which Mz and are diagonal. 
The rows and columns of this representation can be labeled by the eigen- 
values of these two dynamical variables, and we wish to determine these 
eigenvalues along with the corresponding matrix for L. The matrices 
for Mx and My are then given by solving (24.3) and its Hermitian adjoint 
equation : 

Mx = i(/^* + h), My = ~ L) (24.6) 

Relations between Matrix Elements. We label the rows and columns 
of our representation with a pair of symbols m and j. The eigenvalues of 
Mz are mh, so that ??? is a dimensionless real number; j is related to the 
eigenvalues of M^ in a way that is specified below [see Eq. (24. 13)]. Then 
the first of Eqs. (24.5), written in matrix form, is 
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If we work out the matrix products and remember that is diagonal, 
this becomes 

m^)r - = 0 (24.7) 

where is the eigenvalue of that corresponds to f. Equation 

(24.7) tells us that all matrix elements of L vanish except those for which 
f == Thus we can temporarily ignore j in specifying the L matrix, 
and remember when we finish that the matrix with which we come out is 
all for a particular value of j. This may be part of a larger matrix in 
which j can have various values, but there are no nonvanishing off- 
diagonal elements of L between the sections of the matrix that correspond 
to different values of j. 

It is therefore sufficient for the present to use only the symbol m to 
designate rows and columns of L. The second of Eqs. (24.5) then gives 

zL') m ' ~ 

or, since Mz is diagonal with eigenvalues mA, 

(m' - (24.8) 

Thus the only nonvanishing matrix elements of L are those for which 
m' = m" + 1, and we denote these by 

= \Ji (24.9) 


so that \m is a dimensionless number that may be complex. 

If now we take the mth diagonal element of the third of Eqs. (24.5), we 
obtain 





Each sum is seen to contain only one nonvanishing term, so that this 
equation becomes, with the help of (24.9), 


\Xm^i\"- — = 2m 


(24.10) 


Eigenvalues of Mg. Equation (24.10) is a first-order linear difference 
equation in [XmlS and its general solution has one arbitrary constant 


|X,^|^ = C — m(m + 1) (24.11) 

Now \\m\^ is necessarily positive or zero, and yet the right side of (24.11) 
evidently attains negative values for suflSciently large positive and nega- 
tive values of m. This does not cause difficulty if there are two values 
mi and m 2 of m for which X^ = 0, and if these two values differ from each 
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other by an integer. If this is the case, the series of 'vn values in which 
successive terms differ by unity can terminate at both ends without 
1X^1^ becoming negative. Equation (24.8) can be satisfied at the upper 
end (m = mi) by having = 0 rather than by having an eigen- 

value greater than mi, and (24.8) can be satisfied at the lower end 
(m = m 2 ) by having L^,+i. = 0 rather than by having an M. eigen- 
value less than m 2 + 1. \\m[^ is evidently nonnegative for m values that 

range from m 2 + 1 to mi, inclusive. 

We thus have a finite series of eigenvalues of ikf* ranging from mi 
down to m 2 + 1 by unit steps, where mi and m 2 are the larger and smaller 
roots of the quadratic equation C — m(m +1) =0: 

mi = — ^ -^(1 4(7)-‘, m 2 = ■ 5 ' ”1“ 4(7)^ 

We rename mi and call it j, in which case C = j(j + 1), and the eigen- 
values of Mz range from j to —j by unit steps. This implies that 2j is a 
positive integer or zero, or that j is restricted to the series of values 0, 

113 

■J, i,^, . . . . ^ 

Eigenvalues of M^; the L Matrix. Equation (24.11) can now be 
rewritten in terms of j 

= i(i + 1) — ‘Wi(m + 1) = (i — m)(j -h m + 1) (24.12) 

The eigenvalues of can be found by calculating a diagonal element of 
(24.4): 

(M2),, = {m2 + i[i(i + 1) ~ (m. - l)m + j{j + 1) - m(m + l)]\h^ 

= j{j+l)h:^ (24.13) 

We thus have an infinite number of representations for the matrices 
M2, Mz, and L, each of which is characterized by a zero or half-integer 
value for j and has 2j + 1 rows and columns. As expected, the eigen- 
values of M2 are all the same for a particular value of j. All of these 
representations may be taken together to form a single representation of 
infinite rank, although it is often more convenient to consider them 
separately. 

Equation (24.12) leaves the phase of the matrix elements of L arbi- 
trary. This corresponds to an arbitrariness in the choice of the phase of 
the angular-momentum eigenfunctions when they are normalized, and 
is not of physical significance. We therefore choose all the phases to be 
zero, and obtain for the nonvanishing matrix elements of L 

= [U - m){j + m + l)]^h (24.14) 

For j = 0, M2 and the components of M are all represented by null 
matrices of unit rank: (0). The matrices for the next three values of j. 
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as obtained from Eqs. (24.6), (24.13), and (24.14), are 


i = 


3 = 1 


3 = 


M, 




= o) 


o) 

= J) 


M, = 


M. = ih\ 0 


(24.16) 



Comection with the Spherical Harmonics. Comparison of the 
foregoing results with those of Sec. 14 suggests a close connection between 
t e matrix representations of the angular momentum for which j = I 
IS an integer, and the spherical harmonics Yi,„{e,4>) defined in (14 6) A 
comparison of Eqs. (14.22) and (14.23) with the and M. matrices 
shows that the corresponding angular-momentum operators of Sec. 14 
are simply^ another representation of the matrices considered in this 
section. Ihis can also be shown for the L matrix by computing the 
effect of operation with Z on a spherical harmonic. From (14.20) and 
(24.3) , the L operator in polar coordinates is seen to be 


A = *e'>(±+icot0A^ (24.16) 

It is then possible, by making use of the properties of the spherical har- . 
monies discussed in Sec. 14, to show that 

LYi^(d,<f>) = ±[(Z - m){l + m + 1)]1 hYi,„+i(e,4,) (24.17) 
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where the minus sign is for m 0 and the plus sign is for m < 0 (the 
calculation has a somewhat different structure for the two groups of m 
values). Thus if we set up a matrix to represent the operator L in 
analogy with Eq. (22.5), we get just the matrix (24.14) with integer j, 
except for change of some of the signs, which are arbitrary anyhow. 

In similar fashion, we can show' in analogy wuth Eqs. (22.7) and (22.8) 
that Yim{Q^<i>) can be regarded as a unitary matrix that transforms from a 
representation in whic^h the rows and columns are labeled by the angle 
variables to one in wdiich they are labeled by the quantum numbers 
Z,m. The first representation is that which is implicit in the work of 
Sec. 14, and in which L w'ould be represented as the result of operating 
with (24.16) on a suitably normalized h function of the angles. The 
second representation is that developed in this section, in which L is 
given by (24.14) except for some sign changes. 

Spin Angular Momentum. It follows from the work of Sec. 14 and of 
this section that if all of the integer j representations are taken together to 
form a single representation of infinite rank, the M matrices can be 
expressed by (24.1) in terms of r and p matrices that satisfy the commuta- 
tion relations (23.16). This is not true of the half-odd-integer j matrices, 
which are solutions of Eqs. (24.2) but not of the more restrictive Eqs. 

(24.1) and (23.16). Thus the matrices for the components of the orbital 
angular momentum of a particle or a system of particles must have eigen- 
values that are integer multiples of h, since they must be expressible in 
terms of coordinates and momenta. 

There is nothing to prevent a particle from having an intrinsic angular 
momentum that is described by P]qs. (24.2) but cannot be expressed in 
terms of the position and momentum of the particle in accordance with 

(24.1) . The eigenvalues of the components of such an angular momen- 
tum coidd be integers or half odd integers. Moreover, could have a 
single eigenvalue c-orrcsponding to a single value of j, since commutes 
with all three components of M and there is no reason why it should not 
also commute with r and p [since Eq. (24.1) is not valid in this case]. 
Thus could commute with all the dynamical variables that describe 
the particle, and so could be a constant of the motion under all circum- 
stances; there would then be no objection to equating to a definite 
number + 1 )h^. This is not possible for an orbital angular momen- 
tum, since in that case M- docs not commute with r and p, and hence is 
not always a constant. 

An intrinsic*, angular momentum of the type described in the last 
paragraph is called a npin angular momcnhim. 4t is found experimentally 
that electrons, protons, and neutrons each possess a spin angular momen- 
tum for which j and the M matrices are given in (24.15). There is 
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also some reason to believe that particles, called mesons or mesotrons^ 
exist that have masses intermediate between the electron and proton 
masses and spin angular momenta for which j is 0 , or 1 . 

Addition of Angular Momenta. It is sometimes of interest to con- 
sider the vector sum M = Mi + M2 of two angular momenta Mi and M2 
that commute with each other [all components of Mi commute with all 
components of M2, and Mi and M2 separately satisfy the commutation 
relations ( 24 . 2 )]. These angular momenta could refer to independent 
particles, or to the noninteracting spin and orbital angular momenta of 
the same particle. A representation in which Mf, M|, Mu, and M^z are 
diagonal is easily obtained from the foregoing theory. The rows and 
columns are labeled hyji, j^, ^1, and m^, the matrices for Mi, for example, 
have the form ( 24 . 15 ) so far as the indices ji and mi are concerned, and 
are unit matrices as regards the indices and m2. 

It is possible to find a second representation in which Mf, M|, MS 
and Mz are diagonal, for which the rows and columns are labeled by 
3 u 3 h 3 ) ^ [i(i + 1 )^^ is an eigenvalue of M^ and mh is an eigenvalue 

of M2]. If ji and j2 are fixed, the first representation has 

(2^1 + l)(2j2 + 1) 

rows or columns, and can be specified by products of eigenfunctions of 
M12 and of M22, just as the representation ( 22 . 6 ), in which the Hamil- 
tonian is diagonal, is specified by eigenfunctions Uk of the operator H. 
The second representation, with the same fixed values of ji and must 
have the same number of rows, since it is specified by eigenfunctions of 
M^ and Mz that are linear combinations of the original eigenfunctions. 
We now find the values of j and m that appear in the second representation. 

Eigenvalues of (Mi + M2y. Since M, = M12 + M2^, it is apparent 
that the possible values of m are mi + m2. The largest value of m is 
therefore ji + j2, and this occurs only once, when mi = ji and m2 = J2. 
This shows that the largest value of j is ji + j2, and that there is only one 
such state. The next largest value of m is ji j2 — 1 , and this occurs 
twice: when mi = ji and m2 = j2 — 1 , and when mi = ji — 1 and 
m2 = j2 (provided that neither ji nor ^2 is zero). One of the two linearly 
independent combinations of these two states must be associated with 
the new state for which j = ji + ^*2, since for that j value there must be m 
values ranging from ji -f 22 to — Ji — 22 by integer steps. The other 
combination cannot be associated with this or a larger j, since the larger 
m values that should then also be present actually are not. Therefore 
the second combination is associated with j = ji + ^2 — 1 . By an 
extension of this argument we can see that each j value, from ji + is 
down to \ji — j2\ by integer steps, appears just once, and that with each 
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is associated 2j + 1 linearly independent combinations of the original 
eigenfunctions. Thus the number of rows or columns of the second 
representation is 

ii+ia 

^ (2j + 1) = (2ii + l)(2j2 + 1) 

3 = \n-h\ 

as expected. 

The foregoing result is the same as the addition rule for angular 
momenta of the old quantum theory: the magnitude of the sum of two 
angular-momentum vectors can vary from the sum of their magnitudes 
(parallel case) to the difference of their magnitudes (antiparallel case) by 
integer steps. 

The unitary matrix that transforms from the mi, m 2 to the j,m repre- 
sentation for fixed values of ji and ^2 can be found by matrix methods. 
Since its structure is rather complicated it is not quoted here.^ 

Problems 

1. Assume that any Hermitian matrix can be diagonalized by a unitary matrix. 
From this, show that the necessary and sufficient condition that two Hermitian 
matrices can be diagonalized l)y the same unitary transformation is that they 
commute. 

2. Given two matrices A and B that satisfy the following equations: 

A 2 - O, AA* + A*A - 1, B ^ A*A 

where O is the null matrix and 1 is the unit matrix. Show that = B. Find A and 
B in a representation in which B is diagonal, assuming that it is non degen orate. Can 
A be diagonalizcal in any represcnitation? 

3. Given threes matrices A, B, and C that satisfy the following equations: 

A 2 = /^2 =2 aB + BA = BC + CB = CA + AC = O 

where 1 is the unit, mjitrix and O is the null matrix. Find all three matih^es in a 
representation in which A is dingonal, assuming that it is nondegencrate. 

4. Given thr(H> matrices A, B, and C that satisfy the following equations: 

A'^ = B 2 =» = 1 , BC - CB = lA 

where 1 is th<^ unit matrix. Show that A B -f- BA = AC + CA — O, where O is the 
null matrix. Find all three mat,ric(‘S in a nqiresentation in whi<h A is diagonal, 
assuming that it is nondegcmcrjib^. 

5. Mak(^ us(‘ of th(‘. matrix expression (13.18) for x in a representation defined by 
the harmonic-oscillator wave func-tions to obta,in a sirnihir matrix for using purely 
matrix methods, (hinparc with the answer to Prob. 3, Chaj). IV. 

6 . Use purely matrix methods to show that if .c„w 7 ^ 0 for a harmonic oscillator, 

thenB„ — = ±fi(K/n)K Note that for a harmonic oscillator, H = p^/2n + 

xp — px — ifi. 

1 See E. II. Condon and G. IT. Shortloy, ‘‘The Theory of Atomic Spectra,” Chap. 
Ill, Sec. 14 (Macmillan, New York, H135); Chap. TIT also discusses other interesting 
properties of angular momentum. 
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7. Show by purely matrix methods that if H - pV^ju -1- 7(r), 


i 


(En — Em)\Xnm\- = 


2fj. 


where the summation is over all states and a; is a cartesian component of r 

8. If J? = p^/2/i + V {x) for a one-dimensional particle, and V (x) car 
as a power series in x, show by purely matrix mathods that 


^ P 

dt ix 


dt dx 


What is meant by the operator d/dt in this connection? 

9. Transform the Hamiltonian for the harmonic oscillator expres 
representation 

into the p representation. What are the solutions in the p repreacntatio 
spond to the x representation wave functions (13.13)? 

10. A{x,p) and B{x,p) can be expressed as power series in x and p, an 
Show that 

11. Verify Eq. (24.17) by direct operation on the sph<n*i(ral harmonic 
operator given in Eq. (24.16). 



CHAPTER VII 

APPROXIMATION METHODS FOR STATIONARY PROBLEMS 


In quantum mechanics, as in classical mechanics, there are relatively 
few systems of physical interest for which the equations of motion are 
capable of exact solution. Approximation methods are expected to play 
an important part in virtually all applications of the theory. This 
enhances rather than diminishes the importance of those problems for 
which exact solutions can be found, since as was pointed out at the begin- 
ning of Chaps. IV and V, exact solutions are often useful as starting points 
for approximate calculations. In addition, they may also help to estab- 
lish limits of validity for various approximation methods. 

In this chapter and the next we develop several approximation 
methods and apply them to illustrative problems. It is convenient to 
divide the methods int o two groups according as they deal with stationary 
states of systems tluit arc represented by eigenfunctions of the energy 
(considered in ibis ebapixu'), or with problems in which the Hamiltonian 
depends on ibo time (eonsidorod in Chap. VIII). In both cases we start 
with the Sclirdtlinger wave ecpiation, and only occasionally introduce 
matrix methods or notation. 

26. STATIONARY PERTURBATION THEORY 

The stationary peiiurbation theory^ is concerned with finding the 
changes in the dis(u-(‘t-c ciuu'gy levels and eigenfunctions of a system when 
a small disturhanc-e is api)lied. It is assumed from the outset that the 
Hamiltonian // in Ibe Schrrxlinger wave equation can be written as the 
sum of two pai-ts. ( )iie of these parts Ho is of sufficiently simple structure 
so that its S(dir()(rmger ociuatioii can be solved, and the other part H' is 
small enough so that it can he regarded as a perturbation on Hq. It is 
convenient to retain oui* old symbols rin and En for the supposedly known 
normalized (‘ig(uiriitu^tions and eigenvalues of the unperturbed Hamilto- 
nian //o, and use \p and W for the perturbed stationary wave function and 
energy level : 

JI^ = H = Ho + H\ Hoiu = E^Un (25.1) 

Nondegenerate Case. The assumption that IE is small suggests that 
we expand the [)ertui‘bed eigenfunction and eigenvalue as power series in 

^ E. Schrodingcr, Awn. d. Phydk, 80, 437 (1926). 
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H' This is most conveniently accomplished in terms of a parai 
such that the zero, first, etc., powers of X correspond to the zei 
etc. orders of the perturbation calculation. We replace « by X 
express ^ and W as power series in X. We assume that these tw 
are analytic for X between zero and one, although this has not bee 
tigated except for a few simple problems.^ The differe^ orden 
perturbation approximation are then given by the coefficients 
responding powers of X. In the final results, X is set equal to 1 . 

The perturbed wave function and energy level are written 

ij/ = Ip 0 Xi/'i + XV 2 “b X®i/'3 + • • ■ 

W = W, + \Wi + XW2 + XW3 + • • ■ 

and are substituted into the wave equation to give 

{Ho + \H')iiPo + \pi + ■ ■ ■) 

= (Wo + XWi + ■ • ■)('!'» 4- Xv^i + • - •) 

Since Eq. (25.3) is supposed to be valid for a continuous range 0 
can equate the coefficients of equal pow'ers of X on both sides to 
a series of equations that represent successively higher orders 
perturbation. 

HqxI/q = 

Hoipi + = W{)\pi + W i\l^o 

Oto. 

The first of Eqs, (25.4) means that i/'o is any one of the unper 
eigenfunctions, as expected. We therefore put 

i/'o = Uvi, Wo = E-ni 

This state Um is assumed to be nondegencratc, although others 
unperturbed eigenfunctions may be degenerate. 4'he (aiso in whi 
unperturbed state i/'o is degenerate is considorcMl laU'i* in this s( 
First-order Perturbation. It is implicit in tlu^ pr('S(mt treatmer 
the unperturbed state Um is one of a discrete set, (c^vini though the 
set of u's may be partly continuous), since otlun’u isc^ no int.erest 
attach to the calculation of the perturbed onc^rgy. In (he next s 
we consider the perturbation of one of a continuous set of eigenfunc 
in connection with collision problems. 

We expand i/'i in terms of the iin 

4'! = 

^ For a discussion of this point, see N. Arley and V. Borclisonius, Acta Ma 
261 (1945), especially Part IV. 



Sec. 25] APPROXIMATION MET HODS FOR STATION ARY PROBLEMS 151 


where S denotes a summation over the discrete set together with an 
integration over the continuous set of eigenfunctions. Substitution of 
(26.6) into the second of Eqs. (26.4) gives 

^a^^'^HoUn + H'Uni = + W lUyn 

We replace Hi^Un by Eniin in the first term, multiply by Uk, and integrate 
over all space, making use of the orthonormality of the w's:^ 

a]k\Etn ” -Ey.) + Wldkm = jUfcH'UnidT ^ H'km (26.7) 

where the integral on the right is the km matrix element of the perturba- 
tion energy H' in the I’epresentation in which the unperturbed Hamilto- 
nian 27o is diagonal [see Eq. (22.6)]. 

If we take /r = m in Eq. (25.7), we see that 

(25.8) 

which is the expectation value of H' for the state m. Equation (25.7) 
also gives 

a<il> = ^ 

We thus have a solution to first order in W, except that is still unde- 
termined; it is obt ained below from the normalization of \l/. 

Second-order Perturbation. The solutions to second order in are 
found from the third of JOqs. (25.4) by substituting in 

^2 = (25.10) 

to give 

Sail'll 


As before, wo l•(^I)lil(•o //<!«„ by E„Un in ilie first term, multiply by w* and 
integrate over all space f.o obtain 

- E,) = (25.11) 


Jf now we set k = w, we see willi the help of (25.8) that 

iir ^ ^ 

yy 2 — o 7i mil ~~ o o " ly __ u 

E,n — Dn rJm 


(25.12) 


where the prinu^ on S denotc^s the omission of t.lu^ t(M*in n = m from the 
summation and int(^gral.ion over n.. Similarly, if k 9 ^ m in h]q. (25.11), 

^ j'iikffndT is (‘(luiil U) dkn ii* k or n is oiu^ of ii (lis<M’(‘t,(i sot, juitl is equal to 

5(/c — n) if hotli Ix^lon^; to a (xintiiuious S(‘t; in oitlxM* ciiso, ^nSninktUtdr — fk (see Soo. 
10 ). 
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we obtain 

jj f u f 
km^ mm 

(E^ - 

+ (25.13) 

We thus have a solution to second order in H', except that like 
is not yet determined. 

Nonnalization of Since xpo is chosen to be equal to Um, i is already 
normalized to zero order. If then the normalization integral j \^\‘^dr is 
set equal to unity for all orders of X, when \p is given by (25.2), we 
obtain 

J (i/'o^'i + = 0 in first order 

J(i/'o ^'2 + + l^il^)dr = 0 in second order 

These give at once 

+ a«' = 0, + o® + = 0 


7 ( 2 ) 


= S' 




kv,f±nrn 


{Em - Ek){Em - Er.) 


The real parts of and are fixed by these relations, but their 
imaginary parts are not. The choice of the imaginary parts of these 
coefficients is equivalent to the choice of a new phase for \l/ in each order 
of the calculation; this in turn affects the phase of the next higher order 
term in yp. There is no loss of generality involved in making the simple 
choice of zero for these imaginary parts, in which case 

a'i.’ = 0, a® = 


It should be noted that the perturbed energies are independent of this 
choice. 

The energy and wave function to second order in //' are then (setting 
X = 1) 


W = + SI 


En 


+ Si + Si { 


IT f TJ t 

kn-^ nm 


iE„. - E,){E,„ - En) 


(25.1-4) 


HLH„ 


{E„ - Eky\ 


Uk - i 


\HL[^ 

[En, - Ek) 


■) ^(‘n 


It is interesting to note that the calculation of W to a given order in H' 
requires knowledge of yf/ only to the next lower order. 

Perturbation of an Oscillator. As a simple example of the application 
of the first- and second-order perturbation theory to a nondegenerate 
state, we consider the perturbation of the mth energy level of the linear 
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harmonic oscillator of Sec. 13 by an additional energy H' = ^hx^. The 
unperturbed Hamiltonian is Ho = py2n + (the mass is denoted by 
ji to avoid confusion with the quantum number m); the unperturbed 
eigenfunctions u^{x) given by Eq. (13.13) correspond to the eigenvalues 

where m = 0, 1, 2, ... . This example is 
evidently a trivial one since the perturbed eigenfunctions and eigenvalues 
are given simply by replacing by fC + 6 in u^(a:) and it is neverthe- 
less instructive. 

We require the matrix elements of x^ between various pairs of har- 
monic-oscillator wave functions. These may be obtained with the help 
of the generating function (13.10) for the Hermite polynomials, as in 
Prob. 3 of Chap. IV, or more simply by matrix multiplication using the 
definition (21.3) and the expressions for Xnw. given in Eq. (13.18). We 
readily obtain 

(2a-)“‘[(m -|- l)(m + 2)]^, n = m -f 2 

(2a-)“‘(2m 4-1), n — m 

{2a~)-^[m,{m — l)]i, n = m — 2 (25.15) 

0, otherwise 








where a = {fj.K/h-)K Substitution into the first of Eqs. (25.14) then 
gives for the energy to second order 


W = (m + l-)h 



62 1 
SK^ 


in agreement with th(^ expansion of (m + ^)h[{K + 6)/m]^ to second order 
in b. 

Degenerate Case. We now show that the foregoing treatment is 
incomplete when the unperturbed state m is degenerate. We suppose 
first that there is a state k that is degenerate with and orthogonal to the 
state m {Ej, = E,n, iuknjr = 0). Then Eq. (25.7) tells us that = 0. 
Thu.s the first-order perturbation theory developed above is satisfactory 
only if actually is zero. 

If now ///„i = 0 and we suppose in addition that Eq. 

(25. 11) tells us that 

S = S = 0 

n n 9^k,tn 

In this case the degeneracy between the states A* and m is not removed in 
first order, since the first-order perturbed energies for the two states 
{Ek + ////, and Em + are equal. Then the second-order perturba- 

tion theory developed above is satisfactory in general only if there exist 
no states n that connect the states h and m through the perturbation, 
i.e . , only if cither or both of and are zero for all n. 



154 


QUANTUM MECHANICS 


[Chap. VII 


We can summarize the situation by saying that the foregoing pertur- 
bation theory fails in first order if there is degeneracy of the unperturbed 
state in zero order and the perturbation energy H' connects the degenerate 
states in first order. Similarly, the theory fails in second order if there is 
degeneracy of the unperturbed state in first order and the perturbation 
energy connects the degenerate states in second order (through one or 
more other states). 

Removal of Degeneracy in First Order. Let us suppose that the 
perturbation removes the degeneracy of the unperturbed state m in some 
order. This means that there are at least two exact eigenfunctions of the 
Hamiltonian if = Ho + XH' that have eigenvalues that are different 
when X 0 and become equal when X is made to vanish. Now we 
assumed earlier that the eigenfunctions are continuous analytic functions 
of X as X 0; thus each of the eigenfunctions that are nondegenerate 
when X 5 *^ 0, approaches a definite linear combination of the degenerate 
unperturbed eigenfunctions when X = 0. If these linear combinations 
are not the same as the unperturbed eigenfunctions on which the calcula- 
tion is based, the expansions (25.2) are not valid for X = 0, and the 
method developed above breaks down. 

It now becomes clear that we may treat degenerate unperturbed 
states by perturbation theory, if we first perform an exact diagonalization 
of as much of the perturbation matrix H’^i as is necessary to remove the 
degeneracy. This is equivalent to finding the linear combinations of the 
unperturbed eigenfunctions that fit on continuously to the exact per- 
turbed eigenfunctions when X is made different from zero. For example, 
suppose that only the unperturbed state A* is degenerate with m, and 
Hkm = ^ 0. In this case Ave need only diagonalize the submatrix 

(25.16) 

in order to remove the degeneracy (in this case it is removed in first 
order) and find the correct linear combination of and Uk that can be 
used for higher order perturbation calculations. An explicit example of 
this type of situation is given below. 

Removal of Degeneracy in Second Order. It may, however, happen 
that = 0 and so that the degeneracy is not removed in 

first order. A direct but unnecessarily complicated procedure then con- 
sists in diagonalizing the submatrix of the entire Hamiltonian that 
includes the rows and columns labeled by all subscripts n for which either 
or is different from zero. Rows and columns of this matrix can 
be rearranged to bring together any ones that are desired. For example, 
when there are two such subscripts n and Ij we can diagonalize 
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1 mm 

0 

JJt 
^ mn 

H'ml 

0 

Em + H'mm 

HL 

Hu 

TJI 

nm 


En + H'nn 

H'„1 

. HL 

H'lk 


Ei + Hl 


(25.17) 


to remove the degeneracy in second order. 

A less direct but analytically simpler procedure consists in expanding 
the exact eigenfunctions in powers of X as in (25.2), (25.6), and (25.10).^ 
Because of the degeneracy, however, we now include both and Uk in 
the zero-order term. 


— OitnMm ~I“ CLieUk 5 (^^1^^ 

= hmUni + hkUk + S' 

= Un + S(^^nV + ^“«n2)^2; ^ ^ 

ly^n 

where the prime on S indicates that I mjc. Substitution of the first of 
these into the wave equation (//o + == ITwV'w, where 


Em + XTFSJ^ + 

gives to second order in X 

\amH'um + XakH'uk + S'O^^l^^EiUi + X^af^EiUi + X^ai^^H'ui) 

= {XWg^ + XW^^^){am%n + a,n,) 

+ + ^W^Em + X^al^Wg'>ui) (25.18) 

We now multiply (25.18) through on the left by u,n and integrate to 
obtain 

Xa.,//:_ + = XW^Ji^am + (25.19) 

since 7/'^/^ = 0. Similarly, Iilq. (25.18) can be multiplied by Uk and by 
Un {n 7 ^ 'fn,k), and integrated in each case to give 

Xoa/Z'm + = XWg^a, + (25.20) 

Xa„.W;„. + + Xa^^Uin + X*a®/^n + 


= XaiJ'/i’™ + X2a«>£:,„ + X^a^'T^Si' (25.21) 
The first-order terms in (25.19) and (25.20) give the expected result 
If(i) = IN = 11' 

^'711 ^ * nivk ■“ * kk 


‘ J. H. Van Vl(!«k, Phys. Rev., 33, 407 (102<)). S<mi. 4. 
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S'aPH'^ = 


[Chap. VII 


(26.22) 


The first-order terms in (25.21) give an expression for where I = n 9 ^ 

m,k: 

a9-K^m — El) = + akH’i,, (25.23) 

Substitution of (25.23) into (25.22) gives a pair of homogeneous algebraic 
equations for and a*. These equations have a nonvanishing solution 
and only if the determinant of the coefficients of a« and ak is zero [see 
the discussion of Eq. (21.19)]: 


C/ _ 

^ E„,- El 

^ E^- El 




S' 


Em- El 

HI, Hi 


^ kl^ Ik 

Em — El 


- 


= 0 


(25.24) 


The secular equation (25.24) is of second order in this case, while the 
similar equation resulting from (25.17) is of fourth order. It is apparent 

diagon^izltffin ®xact 

Equation (25.24) would also have been obtained if we had started 

MrT2) ^‘^ots of (25.24) are 

hh ^ solutions of (25.22) are am,a„ and 

bm,b. We thus obtain perturbed energy levels in which the degeneracy 
has been removed in second order, and also find the correct linear combi^ 
ations of the unperturbed degenerate wave functions and w* 

First-order Stark Effect in Hydrogen As -in Pvo,^r^i 

j„_ .1 « j. j 1 “ “yurogen. As an example, we now con- 

sider the first-order change in the energy levels of a hydrogen atom due 

E directed along the positive z axis 

TJ T72 

«' tte -uoleu. 

and electron due to the external field and is readily shown to be 


H' = — eEz = -eEr cos d 
where the polar axis is in the direction of positive z. 


(25.25) 


The discussion of Sec. 14 showed that the wave functions for any 
spherically symmetric potential energy, when expressed in spherical 
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harmonics, have even parity when the azimuthal quantum number I is 
even, and odd parity when I is odd. Now the perturbation (25.25) is 
an odd operator, since it changes sign when the coordinates are reflected 
through the origin. Thus the only matrix elements of H' that fail to 
vanish are those for unperturbed states that have opposite parities; in 
particular, all diagonal elements of for the unperturbed hydrogen- 
atom wave functions given in (16.24) are zero. This shows that a non- 
degenerate state, such as the ground state (n = 1) of hydrogen, has no 
first-order Stark effect. 

The first excited state (n = 2) of hydrogen is fourfold degenerate; the 
quantum numbers I and m have the values (0,0), (1,0), (1,1), (I, — !). 
We now show quite generally that nonvanishing off-diagonal matrix 
elements of i?' exist only for states that have the same quantum number 
m. It is apparent from (23.16) that z commutes with the z component of 
angular momentum = xpy — ypa:^ so that [Mz,H] = 0. The ks 
matrix element of this commutator in a representation in which Mg is 
diagonal is {mh — = 0, so that = 0 unless mk = m^. Thus 

only the first two of the above four degenerate unperturbed states need 
be considered in calculating the first-order Stark effect of the first excited 
state of hydrogen. 

Perturbed Energy Levels. The perturbation submatrix that has to be 
diagonalized is of the form (25.16), where = Hkk = 0, and 

= —e£ J ^ 2 io(r)r cos B i^ 2 oo(r) dr 

= — f [ r^(2 ^ dwdr 

Ifiao'^Jo J -i \ ao/ 

— — 3cEU'0, 

where w = cos 6, and use has been made of (16.24). We now wish to 
transform this two-row submatrix of H' from the representation in terms 
of U 2 QQ and to a representation in which it is diagonal and has the 
eigenvalues Wi and W 2 . 

We follow the notation of Kqs. (22.3) and (22.5). The wave func- 
tions are Vi = ^2 = ^^210 for the nondiagonal representation, and 

SnVi + Si 2 V 2 y S 21 V 1 + S 22 V 2 for the diagonal representation. Then in 
accordance with the discussion of Eq. (21.19), the eigenvalues of H' are 
the two roots of the determinantal or secular equation 

Hi^ -Wi 3eEao 

= 0, i = 1,2 

^12 Hi,-Wi 3eEao -Wi 
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These are easily found to be = SeEao, = -SeEao. The trans- 
formation matrix S can be found by writing out the matrix equation 


SH' = 17/8, W = 


/iFi 0 \ 

\ 0 wj 


together with the unitary conditions on 8. The result contains arbitrary 
phase factors, which may be chosen equal to zero, in which case 


S = 2-i 


U -1> 


Thus two of the four degenerate states for n = 2 are unaffected by 
the electric field to first order, and the other two form linear combinations 
^ HM 200 + Msio) with extra energy SeEoo in the electric field, and 

2 “i(W 200 — Msio) 

with extra energy -3eEao. This means that the hydrogen atom in this 
unperturbed state behaves as though it has a permanent electric-dipole 
moment of magnitude 3eao, which can be oriented in three different ways: 
one state parallel to the external electric field, one state antiparallel to 
tne neld, and two states with zero component along the field 

Occurrence of Permanent Electric-dipole Moments. As remarked 
a ove, a permanent electnc-dipole moment (energy change proportional 
to E) can appear in hydrogen only when the unperturbed state is degen- 
erate whereas an induced electric-dipole moment (energy change propor- 
lonal to E ) can appear in any state (see Probs. 1 and 12) . We now show 
t at the first conclusion is generally valid for any system that is described 
by a Hamiltonian that is unaffected by reflection of the coordinates of all 
particles through the origin. It follows from the discussion of Eq (23 26) 
that a nondegenerate state of such a system has definite parity (even or 
)• Then since the electnc-dipole-moment operator is odd, its expec- 
ation value is zero. All the interactions between particles thus far 
encountered in physical theory lead to Hamiltonian functions that are 
unchanged by reflection. Since the ground states of all atoms and nuclei 
are very likely to be nondegenerate, ^ it is to be expected that an atom or a 

the qvatpTTi degeneracy due to the orientation of the total angular momentum of 

thesrdeZemt^tr; "^5' electric-dipole moment because all 

snedal svmm^rv 7® the same parity, degeneracy either is associated with some 
spemal symmetry property of the system (such as the separability of the hydrogen- 
atom wave equation m spherical and parabolic coordinates) that is unlikely to oLur 
for many-particle systems, or is accidental and hence very Wobable on statistical 
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nucleus in its ground state will not possess a permanent electric-dipole 
moment, and none has ever been found experimentally. An extension of 
the foregoing argument leads to the expectation that an atom or a nucleus 
can possess electric charge, electric-quadrupole moment, magnetic-dipole 
moment, etc., but not magnetic pole, electric-dipole moment, magnetic- 
quadrupole moment, etc. (see also Prob. 19, Chap. XI). 

26. THE BORN APPROXIMATION 

In the preceding section we considered the perturbation of those 
energy eigenvalues, and the corresponding eigenfunctions, which are 
part of a discrete set. Here we are concerned with the perturbation of 
one of a continuous set of eigenfunctions. As in Chap. V, such eigen- 
functions are of interest in connection with the theory of collisions. The 
object of the calculation is not to determine an energy eigenvalue, which 
in this case can be fixed in advance, but to find the perturbed eigen- 
function and its relation to the scattering cross section. To simplify 
matters, we restrict our attention to those cases in which the entire 
potential energy of interaction between the colliding particles is regarded 
as a perturbation, and carry the calculation only to first order. As we 
shall see, this Born approximation^ is best applied when the kinetic 
energy of the colliding particles is large in comparison with the inter- 
action energy. It therefore supplements the method of partial waves 
(Sec. 19), which is most useful when the bombarding energy is small. 

Perturbation Approximation. We wish to solve the wave equation 
for the relative motion, Eq. (18.8), 

- ^ V% + V{r)u = Eu, n = — (26.1) 
2jli ^ ’ mi + m2 

and obtain an asymptotic form like that given by Eq. (18.10), 

+ r-•/(0,<^)c’^ ^ (2^-2) 

We adopt the perturbation approac^h of Sec. 25, and put 

u{r) = + 2 ^(r), (26.3) 


grounds. Tn somo inolocidi^s, how(^v(M-, Uhm‘o is a group of iioiirly dogoncrate states 
between wliieh niiitrix elenuuits of the eleetrie-dipole operator exist; if those energy 
levels an^ (rloscdy spa(H>d in c.oniparison with eitluu* the thermal energy of the molecule 
or the energy assexuaU'd with the ai)plied elee.trie fudd, they give rise to a permanent 
electri(!-dipole moment [s(Hi J. IT. Van Vle<*.k, "The Tlieory of Kleetric and Magnetic 
Susceptibilities/' p. 154 (footnote 28), Sees. 48, 70 (Oxford, New York, 1932)]. 

^ M. Born, Zeits. f. Physik, 38, 803 (1926). 
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where the scattered wave v(i) is to be a small addition to the unperturbed 
plane wave solution e^‘. The term D(r) will be found only to first order 
in the scattering potential F(r); the Born approximation becomes quite 
arduous when carried to higher orders. 

Substitution of (26.3) into (26.1) gives 

(_ V2 - /r^)v = - U(r)e'^^ - U{t)v, C/(r) = ^ F(r) (26.4) 

Our assumption that v{j:) is small in comparison with or the roughly 
equivalent assumption that U(t) is small in comparison with /r^, leads us 
to neglect the second term on the right side of Eq. (26.4). We therefore 
have to solve the inhomogeneous wave equation 

(-V2 - F)2;(r) = (26.5) 

where the right side is known. A sufficient criterion for the validity of 
our solution is 

b(r)l = 1, for all r (26.6) 

This condition, while always sufficient, is in some cases more stringent 
than is necessary for the Born approximation to provide useful results. 

Green’s Function. Rather than discuss Eq. (26.5) as a special case, 
we indicate a method of solution of the more general inhomogeneous 
partial differential equation 

(Q - coo)Kr) = F{r) (26.7) 

Here is a Hermitian operator that defines a complete orthonormal set 
of eigenfunctions with real eigenvalues co, and F{t) is a known func- 
tion of r. 

Ou<,(r) = coUo,{t) 

fu^'{T)Ufa{r)dr = 8{o) — co') (26.8) 

iuc^{r)uu{T')do} = d{r — r') 

It is assumed for definiteness that the eigenvalues co form a continuous set. 
Equation (26.7) can be solved by expanding ^>(r) in terms of 

v{i) = jAuUo,(i)do) (26.9) 

Substitution into (26.7) gives 

JAu{o} — coo)wo,(r)dco = i^(r) 

If we multiply this equation by Uu'{r) and integrate over r, we obtain 

_ fuAr)FiT)dr 

f 

CO — coo 
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Thus the solution of (26.7) can be written 

v{t) = J(?„.(r,r')F(rOdr' (26.10) 

where the function 

G«(r,r') = [ do, ( 26 . 11 ) 

J CO — COO 

is called the Green ’s function for the operator and the number wo- 
Green’s Function for a Free Particle. The Green’s function (26.11) 
can be evaluated without great difficulty when the operator Q is the 
Hamiltonian for a free particle. A suitably normalized eigenfunction of 
— V“ corresponding to the eigenvalue is from (11.11) 

Mk'(r) = (2:r)~5 exp ik' ■ r 

where k' is any vector of magnitude k'. Thus the Green’s function is 


= (2x) 


'■/ 


exp (ik' • r) exp {—ik' ■ r') 
k'^ - 


(hk' (26.12) 


We perform the k' integration in spherical polar coordinates with the 
polar axis in the direction of the vector p = r r’. 


G.(r,r') = (2x)-» / / / orzra 

. Jo Jo Jo 

. {f 


(26.13) 


where <r = kp — k\x — r'| is a positive number. 

The singularities in the integrand of (26.13) at /c = io* can be dis- 
cussed in terms of the corresponding singularity of the expansion coeffi- 
cient Ao, in (20.9) at co = coo. The behavior of Ao, at co = coo cannot be 
found from Eq. (20,7) alone, since to any solution v{r) of the inhomo- 
geneous eciuation can be added a solution UuX^) of the corresponding 
homogeneous eciuation. The addition that is to be made can be deter- 
mined c^nly from the boundary conditions that are imposed on v(r)- In 
similar fashion, the contribution to the integral in (26.13) from the infini- 
tesimal neighborhood of the points k = ±cr can be determined only from 
comparison of Eqs. (26.3) and (26.2), which shows that we want only 
those solutions v{t) that have the asymptotic form r-l/((9,c^>)c^^'^ From 
the relation (26.10) between v{t) and Gfc(r,r'), we see that we must evalu- 
ate the integral in (26.13) in such a way that it approaches when cr is 
large. 
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Explicit Evaluation. The contribution to this integral from the 
infinitesimal neighborhood of the points /c = ±(7 can be adjusted by 
regarding it as a contour integral in the complex k plane. The main path 
of integration is along the real axis, and we now show that the choice of 
the contour near the two singularities determines the dependence of the 
integral on cr. 

Suppose that we choose the contour as shown in Fig. 21(a). The 
integral in (26.13) can be written 




K6"' 


dK — ( 2 ^) 




kC 


<r)(K + cr) 


dK (26.14) 


Imaginary 

axis 


a: plane 


_ Real 
-Haxis 


O') (/c + (X) 

The first integral can be evaluated by closing the contour with an infinite 

semicircle G in the positive imaginary 
half plane as in Fig. 21(6), since the 
exponential becomes vanishingly small 
there and contributes nothing to the 
integral. Then the value of the first 
integral is 27ri times the residue of the 
integrand at the only pole {k = <r) that 
lies within the contour : 

The second integral in (26.14) can be 
evaluated by closing the contour with an 
infinite semicircle C' in the negative 


Ca) 



imaginary half plane as in Fig 21c. It is 
equal to — 27r^ times the residue at the 
only pole {k = —c) that lies within the 
contour: Thus the entire expres- 
sion (26.14) is equal to It is not 

difficult to see that the choice of any 
contour other than that of Fig. 21(a) 
would have given a term in addition 
to or in place of the term c'’^. Such a term 
in G corresponds to an incoming wave in 
i;(r), and is ruled out because of the 
asymptotic form (26.2) of the desired 
solution. 

Substitution into Eq. (26.13) shows that the Green’s function for a 
free particle (represented by the operator — V‘-^) is 


ation of the integral in Eq. (26.13). 


G*(r,r') = (47rlr — r'l)-‘ exp ik\i — x'\ (26.15) 

Scattering Cross Section. From Eqs. (26.3), (26.5), (26.10), and 
(26.15) we obtain an approximate solution of the wave equation (26.1) 



Sec. 26] AJ" P POX I M A TION METHODS FOR ST A TION ARY PROBLEMS 163 

w(r) = - (4n-)-i/lr - r'|-i (26.16) 

The second term in (26.16) has the form of a superposition of waves 
scattered from all points r' with amplitudes proportional to the product 
of the incident wave amplitude and the scattering potential at those 
points- 

We assume that U (r') falls off rapidly enough for large r' so that there 
is an asymptotic region in which r is large in comparison with those values 
of r' for which there is a significant contribution to the integrand. We 
can then put 

Ir-r'l >r-wr', (r - r'^ + ^ 

whero w is the cosine of the angle between the vectors r and r'. Thus the 
asymptotic form of (26.16) is 

'u{r) (26.17) 

r—A 00 

Comparison of (26.2) and (26.17) shows that the scattered amplitude 
is 

= - (47r)-ijC7(r') exp iK • r'dr' (26.18) 

Here we define a vector K = ko — k, where ko is the vector of magnitude 
k that has the direction of the incident beam (polar axis), and k is the 
vector of magnit.udc k that has the 
direction of ibe polar angles (0,0) of 
the point at which the scad-tcred ampli- 
tude is measured. Figure 22 shows 
these three vectors; tlie magnitude of 
K is evidc^ntly 2k sin Id. Its physical 
significaiKa^ is that //K is the momentum 
transferred from ibe incident particle 
to the scattei’ing j^otcntial during the 
collision. Thus il a Fourier integral 
analysis of ibe sc.att ering potential into 
harmonic, space wav(‘s is made, it is 
apparent from (be scu^ond integral in (26.18) that the scattered amplitude 
in a ptirfic.uliu- (iiivc-fioii is proportional to the Fourier component of the 
scattering ])()(, cnt.iiil that, corresponds to the momentum change of the 
particle dni'ing the collision. 

The di(Tor( 5 iitial scaltering cross section is given by Eq. (18.11): 

cr(0,<f.) = |/(e.<^)r^ (26.19) 



gation vectors ko for the incident par- 
ticle, k for the scattered particle, and 
the angle of scattering d. The momen- 
tum transfer in the collision is ^K, 
which has the magnitude 2fik sin \d. 



quantum mechanics [Chap. VII 

In the event that U{i) = C/(r) is spherically symmetric, the integrals 
over the polar angles of r' in (26.18) can be evaluated by taking the direc- 
lion of K as the polar axis. Then 


fid) = —K ^ r' sin Kr' Uir')dr' (26.20) 

As expected, the scattered amplitude is independent of the angle <}> It 
^ interesting to note from (26.19) and (26.20) that the amount of scatter- 
ing depends on the bombarding speed v and the scattering angle 6 only 
through the combination K o: v sin ^6, 

Perturbation Treatment of Partial Waves. When 17 (r) is spherically 
symmetric, the wave equation (26.1) can be separated in spherical coordi- 
nates, as was done in Sec. 19, and solved approximately by a perturbation 
method. The radial wave equation for the Zth partial wave is 


j. ^ 

dr 



1(1 + 1 ) 



(26.21) 


^ ** ■ »l'ereA(iT) given by 

Eq. (15.5) IS the unperturbed solution. The approximate equation for 
Xi IS found from (26.21) to be 


dr 



l(l + 1) 

r'i 


Xi = U{r)ji(kr) 


(26.22) 


where a term i7(r)xi(r) has been neglected. 

Equation (26.22) is an inhomogeneous total differential equation of 
which the right side is known. The solution is most readily obtained by 
putting xiir) = vi{r)ji(kr), in which case Eq. (26.22) can be written 


= ryf(A-r)f7(r) (26.23) 

Equation (26.23) can be integrated to give 


dv C 

r^Kkr) £ = r^-3likr)Uir)dr + C (26.24) 

where the constant C of integration must be chosen equal to zero since 
dvi/dr IS finite at r = 0. 

The asymptotic form of (26.24) can be obtained if the integral on 
the right converps with infinite upper limit. Because of the asymptotic 
torm (15.£ of j,, this integral converges when the Born approximation 
integral (26.20) converges; i.e., when 17(r) falls off more rapidly than 
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1/r. In this case the asymptotic form of dvifdr is 

sec^ [Ar - ^{1 + l)x] r^jl{kr)Uir)dr (26.25) 
Integration, of (26.25) gives 

viip) > k tan [kr — + 1)t] f “ r^jj(kr)U{r)dr + C' 

1 — > 00 JO 

from which we obtain 

Ri(r) >• (At)~i {cos [kr — ^(Z + l)ir](l + C') 

r—* 00 

+ sin [kr - Ul + l)’r] " kr^jlQcr) U{r)dr} (26.26) 

Phase Shifts. xi was originally assumed to be small in comparison 
with jij so that the constant C' of integration is small compared to unity. 
The phase shift 8i is defined by the asymptotic form 

(/rr)”^ cos [kr — + l)7r + 8i\ 

for Ri given in (19.8). It is also small, and from (26.26) is given approxi- 
mately by 

Si ^ kr^-jf (At) U (r)cZr (26.27) 

Equation (26.27) is the perturbation or Born approximation expres- 
sion for the phase shifts. If all these 6i are actually small, they can be 
substituted into Eq. (19.11) for f{d) with the approximation 

^2/5, _ 1 ^ 2i8i, 

in which case 

00 

f{d) ^ A-‘ ^ (2Z + l)3jPKcos e) 

/. = 0 

00 

= - ( ” rm{r) [ 2 (2Z + l)i?(Ar)Pj(cos 0)] dr (26.28) 

The . summation in brackets can be shown^ to be equal to sin Kr/Kr, 
where K = 2k sin ^6, so that (26.28) is equal to the Born approximation 
amplitude (26.20), as expected. 

The foregoing perturbation treatment of partial waves is of practical 
interest since it has been found in some cases that substitution of the 
approximate phase shifts (26.27) into the exact expression (19.11) for 

^ G. N. Watson, '‘Theory of Bessel Functions,” 2d ed. p. 366 (Macmillan, New 
York, 1945). 
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the scattered amplitude results in an improvement on the Bom approxi- 
mation amplitude (26.20) when the bi are not all small in comparison with 
unity. In general, it is much easier to evaluate the integral in (26.27) 
than to find the phase shift exactly from a solution of the radial wave 



y 

Fig. 23. {a) Angle distribution function for scattering by a square-well potential, as given 

by Eq. (26.29); (6) total cross section function given by Eq. (20.30). 

equation. Also, the Born approximation amplitude (26.20) can be used 
as a device for summing the partial wave series for large I where the h are 
small; suitable modifications can then be made for the low terms of the 
series. 

Scattering by a Square Well Potential. As a first example of the 
application of the Born approximation result (26.20), we consider the 
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scattering by a square well potential; 7(r) = —Vo, r < a, F(r) = 0, 
r > a. Substitution into (26.20) gives 

f{e) = (sin Ka - Ka cos Ka), K = 2k sin 


Thus the differential scattering cross section is 

(26.29) 

The function g(.x)/g{0) = Qgix) is plotted in Fig. 23o. At high energies 
ikay;> 1), the scattering shows a strong maximum in the forward direc- 
tion so that most of the scattered particles are in a cone whose angular 
opening is of order 1/ka. 

The total cross section is most easily evaluated by changing the 
variable of integration from 6 to x = Ka = 2ka sin ^d, in which case 
sin dd6 is replaced by xdx/{ka')'^. Fjquation (26.29) then gives 


(T 


y{y) 




27r 

{kay 


r2ka 

; / g{x)xdx 
Jo 


327rM-V?,a® 


y{2ka) 


1 f (sin X — X cos xy ^ 

^ jo 

J_ _ i 4- A 

4 ?/'\ ?y- y' J 


(26.30) 


Since y{0) = ^ and y{y) approaches l/4?/2 as y becomes large, the total 
cross section is l() 7 rjLi“Koa^/ 9 /i ‘ in the limit ka <3C 1 and becomes 




when the bombarding energy E measured in the center-of-mass system is 
large. function y{y)/y{^) = I^tC?/) i^ plotted in Fig. 236. 

Validity of the Born Approximation. A convenient criterion for the 
validity of the Born approximation as applied to the foregoing problem 
can be set uj) by using lOq. (2().()) and assuming that v{x) is largest at the 
center of the scattcu’ing potential. This condition is probably sufficient, 
but may be more stringent than is actually required; for exam pie, the small- 
angle scattering (small mom on turn transfer) may bo given correctly 
when the large-angle sc-attering is not. From (26.16), our criterion is thaii 
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1^(0) I = l/g j_ ^ e‘*”-(i+»>F(r)rdr<iw 

~ 2h^k^ ~ ~ 

"" Wh - 22/ sin 2 / + 2 - 2 cos y)^ « 1, y = 2ka (26.31) 

_In^ the low-energy limit ika « 1), (26.31) becomes « 1, 

while in the high-energy limit {ka » 1) it becomes 

/tFoa _ 

h^k hv ’ 

where i; is the speed of the incident particle. If the square well potential 
is about “strong” enough to bind a particle (as shown in Sec. 15 this 
requires that yVoa^/h^ « 1), the Born approximation may therefore be 
used for the computation of the scattering only when A-a » 1. Thus the 
Born approximation supplements the method of partial waves (Sec. 19), 
which is most useful when ka is less tha,n or of order unity. 

The qualitative features of the results obtained here and above for a 
square well potential apply to any potential that possesses a well-defined 
range. 

Scattering by a Screened Coulomb Field. As a second example of the 
application of the Born approximation, we consider the elastic scattering 
of an electron by a neutral atom that is represented by a simple form 

of screened Coulomb potential:* F(r) = -(ZeVr)e"i This potential 
energy behaves like the nuclear Coulomb potential for atomic number Z 
when r is small, and falls off rapidly when r is large in comparison with the 
radius” a of the a,tomic electron cloud that screens the nucleus. The 
rhomas-Fermi statistical theory of the atom (see Sec. 38) shows that for 
moderately heavy atoms, a is roughly equal to hP-jme^ZK where m is the 
electron mass. 

Substitution of this potential into (26.20) gives 

2mZe^ f ” . _ , 2mZe^ 

= ~¥ir jo = ¥( K^ + a-^y ^ (26.32) 

* more detailed discussion of this problem, see N. F. Mott and H. S. W. 

Massey, 'The Theory of Atomic Collisions,” Chap. IX, Secs. 3, 4 (Oxford, New York, 
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This gives a cross section in agreement with the Rutherford result (20.11) 
when the momentum transfer is large enough so that 1/ can be neglected 
in comparison with in the denominator; in the analogous classical 
situation, the particle passes close to the nucleus, so that the screening 
electrons are relatively ineffective. Equation (26.32), unlike the Ruther- 
ford result, yields a finite cross section at vanishingly small angles; the 
analogous classical particles pass far from the nucleus and are well 
screened from it by the atomic electrons. The total cross section is 


O’ 



2TrKdK 


F(4WT~T) 


With the above Thomas-Fermi expression for a, this becomes 4:TrZ^/k^ 
at high energies (A’a » 1), which agrees in order of magnitude with the 
result of a numerical integration^ of the scattering produced by the 
Thomas-Fermi potential. 

The criterion (26.31) for the validity of the Born approximation 
becomes 


2mZe‘^ 

~¥k" 


I 


ix-rr dx\ 
sin X e — 

.T 


« 1 


where x replaces kr as the variable of integration. For /ra <$C 1, this 
becomes 2mZcMlh:^ « 1, which with the above approximate expression 
for a is eciuivalent to Z'‘^ « 1 ; thus the Born approximation cannot be 
used for the scattering of slow electrons by atoms. For A'*a» 1, the 
criterion becomes {Zc^Jhv) In ka <$C 1. Since it turns out that this result 
is substantially unaffected by relativity theory the approximation 
becomes poor for the heavier elements, where 


he 137 


becomes comparable with unity. 

It is interesting to note the close correspondence between the various 
results for the scpuire well potential and tor the screened (Coulomb field 
Avhen a is chosen the same in the two cases and Vo ~ Ze^/ a. 


27. THE VARIATION METHOD 

The variation method has been used mainly for the approximate 
determination of the lowest or ground-state energy level of a system.^ 
It can be used when there is no closely related problem that is capable of 

^ E. C. liiillard and H. S. W. Massey, Proe. Canib, Phil. Soc., 26, 556 (1930). 

® It has also been applied recently to collision problems by J. Schwinger, Phys. 
Rev., 72, 742 (1947). 
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exact solution, so that the perturbation method is inapplicable. The 
variation method can also be applied to systems that are described by a 
nonseparable Schrodinger equation, in which case numerical solutions are 
extremely arduous and the WKB method (Sec. 28) cannot be used. 

Expectation Value of the Energy. It was shown in Sec. 10 that if an 
arbitrary normalized function \f/ is expanded in energy eigenfunctions 

\p = ) AeUb, where Hue = Eue (27.1) 

E 

and the Ue form a complete orthonormal set, then the expectation value of 
H iov the function is given by 

(H) = j mdr = ^ E\Ae\^ (27.2) 

B 


where the integration is extended over the entire range of all the coordi- 
nates of the system. It is assumed for convenience in Eqs. (27.1) and 

(27.2) that the energy eigenvalues are all discrete; this can be accom- 
plished by enclosing the system in a box (Sec. 10), or else the summation 
can be replaced by the symbol $ (Sec. 22). 

A useful inequality can be derived from Eq. (27.2) by replacing each 
eigenvalue E in the summation on the right side by the lowest eigenvalue 
Eq'. 

{H) S ^ ^ \A4^ (27.3) 

E E 

Since ^ = 1 for a normalized function \l/, as was shown in Sec. 10, 

E 

(27.3) yields the inequality 

Eq ^ (27.4) 


In the event that is not normalized, (27.4) evidently can be rewritten as 


Eq 


= mdT 


(27.5) 


The variation method^ consists in evaluating the integrals on the 
j-ight side of (27.4) or (27.5) with a trial Junction \p that depends on a 
number of parameters, and varying these parameters until the expecta- 
tion value of the energy is a minimum. The result is an upper limit for 
the ground-state energy of the system, which is likely to be close if the 

^ The method was originally appluul l)y Lord Rayleigh in 1873 to the computa- 
tion of the vibration frequencies of mechanical systems — “Theory of Sound,'’ 2d rev. 
ed., vol. 1, Sec. 88 (Macmillan, London, 1937; reprinted by Dover, New York). 
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form of the trial function resembles that of the eigenfunction (see Prob. 9). 
Thus it is important to make use of any available information or physical 
intuition in choosing the trial function.^ 

Application to Excited States. The variation method can also be 
used to obtain an upper limit for one of the higher energy levels if the 
trial function is orthogonal to the eigenfunctions of all the lower states. 
Suppose that the energy levels are arranged in an ascending series: 
So, Eij ^ 2 , . . . - Then if rp is orthogonal to for i = 0, 1, . . . , n, 
it is easily seen from (27.1) that the corresponding expansion coefficients 
Asi are all zero. An inequality can be derived from (27.2) by replacing 
each eigenvalue E in the summation on the right by En+i, with the 
result that the expectation value of the energy is an upper limit on this 
eigenvalue. 

The trial function p — UEjuB^^dr is evidently orthogonal to 'Wjeq, so 
that if the lowest eigenfunction is known either from an exact solution or 
to a sufficiently good approximation from a variation calculation, an 
upper limit for the energy of the first excited state can be computed. 
Trial functions that are orthogonal to any number of known eigenfunc- 
tions are easily found in this way. 

It is sometimes possible to divide the energy eigenfunctions into 
groups such that any member of one group is orthogonal to any member 
of any other group. Suppose that there is a Ilermitian operator F that 
commutes with H {FII — HF = 0); then from a theorem of Sec. 21, F 
and H can be diagonalized simultaneously and have common eigenfunc- 
tions. Now any two eigenfunctions of F that correspond to dillerent 
eigenvalues are orthogonal to each other.- Thus a trial function that is 
constructed entirely from eigeni unctions oi F that correspond to a given 
eigenvalue is orthogonal to all other eigenfunctions that correspond to 
different eigenvalues of Fj and will provide an upper limit for the lowest 
energy eigenvalue that is associated with this eigenvalue of F. The fore- 
going results are usciful when the ()[)ern-tor F is one whose eigenfunctions 
are easily recognizable by some simple property, sindi as, for example, the 
symmetry in case F is the angular moiiKMitum or the parity, then a 
trial function with angular (lep(ui(hMT(^e corresponding to a particular 
angular momentum, or with a pariiic-ular parity, can easily be written 
down, and gives an upper limit for the lowest energy level that has this 
angular momentum or parity. 


^ For an extension of this method th«it gives both upper and lower limits, see D. H. 
Weinstein, Proc. Nat. Arad. SrL, 20, 520 (1034). 

^ This is shown explie/itly in thp (10.4) lor tlu’! (Miergy opcTJit^or, and the proof givcm 
there is easily cxtend(Ml to any Hennitian operator. 
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Ground State of Helium. As a first example, we use the variation 
method with a simple trial function to obtain an upper limit for the energy 
of the ground state of the helium atom. The helium atom consists of a 
nucleus of charge +2e surrounded by two electrons; from Eq. (16.1) 
we find that its Hamiltonian is (neglecting the motion of the nucleus) 




where ri and are the position vectors of the two electrons with respect 
to the nucleus as origin, and ri 2 = |ri — 12 ! is the distance between the 
two electrons. 

If the interaction energy e^/ri 2 between the two electrons were not 
present, the ground-state eigenfunction of H would be the product of 
two normalized hydrogenic wave functions uioq(ti)uiqo(t 2 ) given in Eq. 
(16.24): 


^(ri,r2) 


-('^')(n+r2) 

— 5 e 
Trag 


(27.7) 


with Z = 2. We shall use (27.7) as a trial function, and permit Z to be 
the variation parameter so that it is not necessarily equal to 2. 

It follows from Prob. 13, Chap. IV, that the expectation values of the 
kinetic and potential energies for the ground state of a hydrogen atom are 
e^/2aQ and respectively; the corresponding hydrogen wave func- 

r 

tion is (Trag)” h The expectation value of either of the kinetic energy 
operators in (27.6) for the function (27.7) is obtained most easily by not- 
ing that operation with the Laplacian gives a result that is inversely 
proportional to the square of the length scale of the wave function; since 
the scale of (27.7) is smaller than that of the hydrogen wave function by a 
factor of Zj the expectation value of each of the kinetic energy operators 
is e^Z^/2aQ. Similarly, the factors 1/r make the expectation values of the 
nuclear potential energy operators inversely proportional to the length 
scale; there is also an additional factor 2 from the nuclear charge, so that 
each one is —2e'^Zfa^, 

Electron Interaction Energy. The expectation value of the inter- 
action energy between the electrons is 

j j ^'(ri,r2) ^(ri,r2)dTidr2 
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This integral is most easily evaluated by regarding it as the mutual elec- 
trostatic energy of two overlapping, spherically symmetric, charge dis- 
tributions, in which case simplifications from the theory of electrostatics 
can be introduced. 

A more general way of performing the integration, which can also be 
used for wave functions that are not spherically symmetric, consists in 
expanding l/ri 2 in spherical harmonics. 


Tn 


«0 



Pi(cos 6), 



> Ti 


ri < r-2 


(27.9) 


where 8 is the angle between ti and ra, cos d = cos Si cos 82 + sin 61 ■ 
sin 62 cos {(pi — ^ 2 ), and 6 i, 4 >i and 02, <#>2 ai-e the polar angles of the vectors 
Ti and Ta, respectively. ‘ It can be shown'-* that 
Pj(cos 8 ) = Pj(cos 0i)P!(cos 82 ) 

I 

+ 2 ^ Pr(cos 0 i)Pr(cos 02 ) cosm{<t>i - ^ 2 ) (27.10) 

m = 1 

When (27.9) and (27.10) are substituted into (27.8) and use is made 
of the orthogonality of the spherical harmonics, the integration over the 
polar angles of ri causes all terms to vanish except that for which I and 
m are zero. The integral on the right side of (27.8) becomes 



2Z 

au 


(n+rz) 


r|dr2 + 



1 (ri-f-ra) 

— e 

T'l 



r\dri 


which can be evaluated as ^Tr'^al/SZ^. Thus the electron interaction 
energy has the expc^ctation value 5c“Z/8ao. 

Variation of the Parameter Z. We now have the result that the 
expectation value of the Hamiltonian (27.6) for the trial function (27.7) 
is 



4c^Z be^Z 

do ScLq 


= ^ {Z^ - V-Z) 

do 


^Equations (27.9) follow at once from the geiioratinK fun<‘.tion (14.10) for the 
Legendre polynomials; the expression for eos 0 is simply obtained from the scalar 
product of th(^ vectors ti and r-j in rectangular coordinates. 

® E. T. Whittaker and G. N. Watson, Course of Modern Analysis/’ 4th ed., 
p. 328 (Cambridge, Ijondon, 1935). 
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Differentiation with respect to Z shows that this is a minimum when 
2 = = 1.69. Thus the lowest upper limit for the ground-state energy 

of the helium atom obtainable with this trial function is 


= 

Cl/Q 


-2.85- 

CLo 


The experimental value for the energy required to just remove both 
electrons from a helium atom is 2.904e^/ao, so that our limit is about 1.9 
per cent high. The most careful variation calculation of the ground-state 
energy of helium gives a result in excellent agreement with experiment/ 
and provides an important verification of the theory of quantum 
mechanics. 

The result that hydrogenic wave functions give the best energy value 
when Z = rather than 2 indicates that each electron screens the 



Fig. 24. Two hydrogen atoms, with nuclei at A and B separated by a distance R, have 
electrons at 1 and 2; their interaction is given by H' in Eq. (27.11). 

nucleus from the other electron, the effective nuclear charge being reduced 
by Ts- of an electronic charge. 

If the electron interaction term e^/r^ is regarded as a perturbation, 
the first-order perturbation energy is given by {H) with Z = 2 and is 
— 2nhe^la^, which is 5.3 per cent above the experimental value. It 
is apparent . that, in general, the first-order perturbation calculation is 
equivalent to a nonoptimal variation calculation. 

van der Waals Interaction- As our second example of the application 
of the variation method, we calculate the van der Waals (long-range) 
interaction between two hydrogen atoms in their ground states. It is 
convenient to consider this problem first by means of the perturbation 
theory, since it is then easier to see that the leading term in the energy at 
great separation distances varies inversely as the sixth power of this 
distance. Also, it turns out that the perturbation theory and the varia- 
tion method provide opposite limits for the coefficient of this term. 


^ E. A. Hylloraas, TtciU. f. Physik, 66, 209 (1030). Tho host value obtained by 
Hylleraas is slightly lower than the experimental value, contrary to theoretical expecta- 
tion; the discrepancy must be attributed to a numerical (‘rror in the calculation, to 
experimental error, to some small interaction energy that was not taken into account, 
or to an inadequacy of the theory [see H. A. S. Eriksson, Nafvre, 161, 393 (1048)1. 
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We assume that the nuclei of the two hydrogen atoms, A and B, are 
fixed in space a distance R apart, and that the z axis is chosen parallel to 
the line through A and B. Then if ti is the vector displacement of elec- 
tron 1 from nucleus A and is the vector displacement of electron 2 from 
nucleus B (see Fig. 24), the Hamiltonian for the two electrons can be 
written 


Ho = 
H' = 


H = Ho + H' 

_ " (Vf -f VI) - - 
2m ^ ^ ri 

£! -L ii _ ii _ 

R Tib T2a 


r2 


(27.11) 


The unperturbed Hamiltonian Hq has the solution 
t^o(ri,r2) = 'Wioo(ri)i/ioo(r2) 


for two noninteracting hydrogen atoms in their ground states. We 
regard the interaction terms as a perturbation; this is equivalent to 
assuming that R ao. 

Since we are interested in the leading term in the interaction energy 
when R is large, we expand H' in powers of 1/R and keep the lowest 
terms. 




_ £2 I 
R 1 


1 + 


2(22 — 2i) , (^2 — + (?y2 — y\Y + (g2 — ^ 

i ~r 7i ~ * 


It 


-(i-l' + F.) ‘-('+X+I) 1 


= 7^3 (^1^2 + 2/1Z/2 - 22122) 


(27.12) 


The last term is the interaction energy of two electric dipoles that cor- 
respond to the instantaneous configurations of the two atoms. ^ 

It is apparent at once that the expectation value of the leading term 
in H' for the state Wo(ri,r2) zero, since Un is an even function of ri and 
12 and //' is an odd funct ion of ri and r2 separately. It can also be shown 
that all the neglected higher terms in //' have zero expectation value for 
Wo, since these terms can be expressed as splierical harmonics of order 
different from zero. Thus the leading t(M'm in the interaction energy is 
the second-order perturbation of the dipole-dipole term, which is pro- 
portional to //'- and hence varies like l//iJ^ 


1 The nefj;lc!(5ted terms in the exiianaion (27.12) that vary like l/R^ are the. dipolo- 
quadmpole iiitcractioii; the 1/H^ tonus are th(‘. quadrupohi-quadrupele interaction, 
etc. 
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Perturbation Calculation. From Eq. (25.12), the second-order 
change in the energy of the two hydrogen atoms is 

WiR) = S' 

where the index n refers to all states of the pair of unperturbed hydrogen 
atoms (including dissociated states), and the ground state Uo is excluded 
from the summation and integration that is denoted by If is appar- 
ent that WiR) is negative, since < E^ and the numerator of each term 
in (27.13) is positive. We thus conclude that the interaction is attractive 
and proportional to l/i2®, when R is large; both these conclusions can be 
shown to be valid for any pair of atoms that are in nondegenerate, 
spherically symmetric ground states. 

We can obtain an upper limit on the positive quantity —W{R) by 
replacing each E^ in (27.13) by the energy E^* of the lowest excited state 
of the two hydrogen atoms for which is different from zero. Then 
the denominator can be taken outside of the summation, which can be 
evaluated as a matrix product 

= S'H'A = SH'A - HZ = (J?'“)oo - HZ 

Since we have seen that Hqq = 0, we have that 

The state n* is that in which both atoms are excited to states of princi- 
pal quantum number 2, so that Eq = — 2(cV2ao), En* = — 2(eV8<^o)j 
and En* — J?o = 3e^/4ao. From (27.12) we have 

H'^ = ^ (xlxl + y\y\ + 4zfz| + 2xxXiyxy-i - . . . ) (27.16) 

The expectation value of the cross-product terms like xix^yiVi is zero 
since these terms are odd functions of one of the cartesian components of 
ri or r 2 . The first three terms in the parenthesis of (27.15) are each the 
product of two identical factors that are equal to 

j a;Vioo(r)l2dr = i j ^^|^^ioo(r)|2dT 
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SO that (Jy'^)oo = Substitution into (27.14) gives 


WiR) (27.16) 

Variation Calculation. An upper limit on W{R) can always be 
obtained by the variation method. It is apparent, however, that some 
judgment must be used in the choice of the trial function \p] thus if ^ 
does not depend on R, the dependence of the expectation value of the 
energy on R will be like that of //', that is, An upper limit with 

this R dependence is of no value to us, since what we really want to 
determine is a limit on the coefficient of the 1/R^ interaction. A useful 
choice for ^ will be one in which there is a term proportional to since 
there Avill then be terms in the expectation value that are proportional to 
and hence vary like 1/J2®. 

We choose for the trial function 

\//(xijT2) = Uioo(ri)uioo(r2)(l "h AH') 

where A is to be the variation parameter. Since this \p is not normalized, 
we use (27.5) rather than (27.4) and obtain 

T. . ^ !!Ml + AH')(H, + + AH')dr,dr 2 ... 

E, + W{h) g TJula + AH'Y<i^ir, 

where again ito is the product of the ground-state hydrogen wave func- 
tions, and A is assumed to be real. The right side of (27.17) can be 
written 

(27.18) 

since Uq is a normalized eigenfunction of Hq with the eigenvalue 


Hu 1 

nu 

and Hoq = (//'®)n() == 0. It is easily seen that {H'HnH')uo is a sum of 
squares of factors of the form J7^ioo(r) x Ho z ?>tio()(r)ci(r; this can be shown 
by direct computation to be zero. 

Since we are inter(isted only in terms of order we expand the 
denominator of (27.18): 

[Eo + 2y1(//'^-)o,)][l + vl “(^/'^)ool-' ^Eo + {I-n)m{2A - EoA^) (27.19) 

If we remember that Eo is negative, we find that (27.19) has a minimum 
with respect to variation of A when A = I/Eq^ in which case (27.17) 
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becomes 

E, + WiR) ^Eo + = Eo- ^ (27.20) 


Thus in (27.16) and (27.20) we have both upper and lower limits on the 
interaction energy 




^ W{R) ^ - 


Qe^al 


More careful variation calculations have shown that the numerical coeffi- 
cient in W(R) is very nearly 6.50.^ 

28. THE WKB APPROXIMATION 

In the development of quantum mechanics, the Bohr-Sommerfeld 
quantization rules of the old quantum theory (Sec. 2) occupy a position 
intermediate between classical and quantum mechanics. It is interesting 
that there is a method for the approximate treatment of the Schrodinger 
wave equation that shows its connection with the quantization rules. It 
is based on an expansion of the wave function in powers of h, which, 
while of a semiconvergent or asymptotic character, is nevertheless also 
useful for the approximate solution of quantum-mechanical problems in 
appropriate cases. This method is called the Wentzel-Kramers-Brillouin 
or WKB approxmation, although the mathematical technique seems to 
have been first introduced by Rayleigh in another connection. ^ It is 
applicable to situations in which the wave equation can be separated 
into one or more total differential equations, each of which involves a 
single independent variable. 

Classical Limit. A solution i/'(r,i) of the Schrodinger wave equation 
(6.16) 

can be written in the form 

= A exp — ^ 


1 See L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum Mechanics,” 
Sec. 47a (McGraw-Hill, New York, 1935). 

2 It is sometimes called the BWK method^ the classical approximation^ or the phase 
integral method. For the original work, see Lord Rayleigh, Proc. Roy. Soc. A86, 207 
(1912); H. Jeffreys, Proc. London Math. Soc. (2), 23 , 428 (1923); G. Wentzel, Zeits. f. 
Physik.j 38 , 518 (1926); H. A. Kramers, Zeits. f. Physik., 39 , 828 (1926); L. Brillouin, 
Comptes Rendus, 183 , 24 (1926). For more recent developments, se(^ E. C. Kemble, 
“The Fundamental Principles of Quantum Mechanics,” Sec. 21 (McGraw-Hill, New 
York, 1937); R. E. Langer, Phys. Rev., 61 , 669 (1937); W. H. Furry, Phys. Rev., 71 , 
360 (1947). The treatment of this section resembles most closely those of Kraimu'S 
and Langer. 
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in which case W satisfies the equation 

f; + i(gradHO= + F-|v>W'-0 (28.1) 

In the classical limit {h 0), Eq. (28.1) is the same as Hamilton’s partial 
differential equation for the principal function W 

^ =0, p = grad W 

Since the momentum of the particle is the gradient of W, the possible 
trajectories are orthogonal to the surfaces of constant W and hence, in 
the classical limit, to the surfaces of constant phase of the wave function 
4^. Thus in this limit the rays associated with \[/ (orthogonal trajectories 
to the surfaces of constant phase) are the possible paths of the classical 
particle. 

lEt 

If 4/ is an energy eigenfunction u{T)e ^ , W can be written 
W{Ty{-) = S{t) - Et 

In this case, we have that 

^.(r) = A exp \ ^ (grad - [JH - 7(r)] - = 0 (28.2) 

The WKB method ol)tains the first two terms (one term beyond the clas- 
sical expression) of an expansion of S in powers of ti, in the one-dimensional 
case. 

Approximate Solutions. The basic equation that we consider is 
written in one of the forms 

+ /<•'•“(•»•)'* = 0, k- > 0 (28.3) 

- /c-0r)(t = 0, K- > 0 (28.4) 

so that k and k are always inal. ddicse are e(iiiival(ait to the one-dimen- 
sional wave e(iuat ion (8.5), if we put 

/K-'O = + j - r(aO]!i when V{:t) < E 

f (28.5) 

K(.t) = + 1 i2M[F(.r) - E]\i when F(.c) > E 

^ E. T. Whittjik(‘r, “Analyticul Pynainu'S,” 3(1 ed., \42 (Csunbridgo, London, 
1927). 
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Equations (28.3) and (28.4) are also equivalent to the radial wave 
equation (19.2) if a: is replaced by r, V (r) is replaced by 

( ) M(l+_1) 

^ ^ ^ 2/ir^ 

and u is equal to r times the radial wave function. 

We restrict our attention for the present to Eq. (28.3) ; we shall be 
able to generalize the resulting expression for u(x) to obtain solutions of 
(28.4). We put 

iS(x) 

u{x) = Ae ^ 

which on substitution into (28.3) gives the one-dimensional form of 
(28.2) 

ihS" - = 0 (28.6) 

where primes denote differentiation with respect to x. 

We substitute an expansion of S in powers of h into (28.6) and equate 
equal powers of h. 

aS = >So hSi 

-aS'o^ + 2m(£ - to = 0 

iS'o' - 2S'oSi = 0, etc. 

Integration of these equations gives 

>So(x) = ±h Hx')dx', Si{x) = a In k{x), 

where arbitrary constants of integration that can be absorbed in the 
coefficient A have been omitted. We thus obtain to this order of 
approximation 

u(x) = Ak-i exp ( + 0* kdx), V < E (28.7) 

In similar fashion, the approximate solution of (28.4) is 

u(x) = BK-i exp (± j"" Kdx), V > E (28.8) 

Asymptotic Nature of the Solutions. The accuracy of these WKB 
solutions can be gauged by comparing the magnitudes of the successive 
terms So and hSi in the series for Since So is a monotonic increasing 

hS 

function of x so long as k does not vanish, the ratio is small if 

aS() 

hS^ 

is small. We thus expect (28.7) to be useful in that part of the 
Oq 

domain of x where 


hS[ 


k' 

SI 


2k^ 


«1 


(28.9) 
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Now the local de Broglie wave length X is 27r//r, so that (28.9) can be 
written 


X’ dk 
At dx 


«7r 


which means that the fractional change in k (or in the wave length) in 
the distance X/4 t is small compared to unity. Thus the WKB solutions 
are useful when the potential energy changes so slowly that the momen- 
tum of the particle is sensibly constant over many wave lengths. 

The same criterion is obtained for (28.8) if we now mean by the '^wave 
length^’ the distance in which the magnitude of u changes by a factor 

It is apparent that the condition (28.9) is violated near the turning 
points of the classical motion, where V{x) = E, k and k are zero, and 
the “wave length'’ is infinite. Thus the solutions (28.7) and (28.8) are 
asymptotically valid in the sense that they can be used several wave 
lengths from the nearest turning point if, as is usually the case, the wave 
length is there slowly varying. 

The asymptotic solutions are of little use to us unless we know how to 
connect an oscillating solution like (28.7) to an exponential solution like 
(28,8) across a turning point. It is only in this way, for example, that 
we can apply boundary conditions and obtain energy eigenvalues. The 
derivation of such connection formulas^ which we consider next, is the 
central problem of the WKB approximation. 

Solution near a Turning Point. The wave equations (28.3) and (28.4) 
are regular at a turning point, so that there is a solution that is analytic 
there and has asymptotic forms like (28.7) and (28.8). Such a solution 
usually cannot be written down in closed form. The wave equation can, 
however, be modified slightly so that an exact solution that has the desired 
asymptotic forms can be obtained. 

We can w ithout loss of generality take the origin of a; at a particular 
turning point; w'c also assume for the moment that V {x) K E io the right 

of the turning point (positive x)^ and put ^(.r) = kdx. Now if 

= C.r”, where C is a positive constant, Eq. (28.3) is known to have 
the solutions 

uia:) = m = (28.10) 


where J is a Bessel function; this can be verified by direct substitution. 
The avSymptoti(; form of J is such (see below) that (28.10) agrees asymp- 
totically with (28.7). 

We therefore try to retain this form by rewriting (28.3) with an 
additional term d(x ) : 



182 


QUANTUM MECHANICS 


[Chap. VII 


g + - e)u = 0 (28.11) 

Substitution of (28.10) into (28.11) shows that the new equation is 
satisfied if we define 6 as 

We expand s^b 2 ^ power series in x: 

k^{x) = Cx^il + ax + hx‘^ + • ^ ) 

in which case B can also be expanded in a series. The l/x"^ and 1/x terms 
vanish, and the leading term is independent of x. 


e(x) — 


3(n + 5)a^ 
2{n + 4)(m + 6) 


(28.13) 



Fig. 26. A typical linear turning point, where Vix) = K at a: = 0; in region 1, E > V{x), 
and in region 2, E < V{x). 


We can now see that (28.11) is a good appi’oximation to the actual 
wave equation (28.3). The similarity in structure between each of the 
three terms in (28.12) and the asymptotic accuracy criterion (28.9) 
indicates that 6 « in the asymptotic region if the WKB method can 
be used at all. At and near the turning point, 6 is not negligible in com- 
parison with k^j since 0 is a constant and A*- vanishes at a: == 0. However, 
(28.13) shows that 0(0) is quite small, being of second order in the devia- 
tion of from the simple form Cx^, Thus for potential functions V (.r) 
that are slowly varying, (28.10) is expected to be a good approximation to 
the actual solution of equation (28.3). 

Linear Turning Point. We now specialize to the situation of greatest 
physical interest, in which n = 1. A typical linear turning point is 
shown in Fig. 25; Eq. (28.3) is used in region 1 ( 2 ; > 0), and Eq. (28.4) in 

region 2 (a: < 0). We put ^ 1 = kdx, ^ 2 = f<dx, so that both 

and ^2 increase as x moves away from the turning point; this makes it 
easy to generalize the results to situations in which the regions 1 and 2 
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are interchanged, 
regions are 


The two independent solutions in each of the two 


U2^{X) = B±{2iK-4/±i(f2) 


(28.14) 


It is evident that we must replace J by I, the Bessel function of imaginary 
argument, in region 2. 

We require the leading terms of the power series expansions and of 
the asymptotic expansions for these functions 








.->0 r(i ± i) 

> cos 

K— > 00 




.->0 r(i + i) 

> (2ir?2)- He*’ + 


(28.15) 


It is important to note that the term in the asymptotic expansion 
for I can be retained only when a combination of solutions is chosen 
such that the coefficient of ef* is zero. This is l)ecause other terms in the 
asymptotic expansion, such as have been neglected, and these are 

of larger order of magnitude than Tlie asymptotic nature of the 

WKB approximation is such that if the term that increases exponentially 
away from t.he turning point is present, it is impossible to say whether or 
not the (leci'easing exponent, ial term is also there. 

Connection at the Turning Point. 'J’ho leading terra in /(•“ at a; = 0 
is Cx, so that /r ^ c.r-l, k ^ c|.r|i, J, ^ (2c/,3).r5, J.. ^ (2c/3)|a:|8, where 
c = '^riien from (2S.1I) iind (28.1.5) we obt.ain the behavior of the 

7/.’s near a: = 0 


w-J- ^ .1-,. 

vt ^ /f-|. 


(li)=(ic) 

l’(-i) 

(.5)Hffc) 

l’(i) 




- - r(f) 


It is apparent t.lien t.hiit, //,)“ joins sinool.hly on t.o if B+ = — A+, and 
that ay joins smoothly on to if li- = 

l''h(^s(( rdations hcitwemi tlui ecxdlicicmts cjin be used to obtain asymp- 
t,ot,ic forms like (28.7) and (28.8) for the iwo iml(qxni(lent solutions u+ 
and vr in the two r(‘gions (t.lui arbitraiy mult,i[)Iying constants A± are 
omitted). 


^ WhiUuktM* nnd Watson, op. r//., (.Unif). 17. 
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The asymptotic forms of any linear combination of and u~~ can be 
found from Eqs. (28.16). 

As 3 miptotic Connection Formulas. Convenient connection formulas 
between the asymptotic WKB solutions in the two regions can be 
obtained by choosing suitable linear combinations of and u-. Thus 
the combination + u~ contains only the decreasing exponential, and 
yields the first connection formula 


—^k~i cos (^1 — ir) 


(28.17) 


The arrow in (28.17) implies that the asymptotic solution in region 2 that 
appears on the left goes into the asymptotic solution in region 1 that 
appears on the right, but that the converse is not necessarily true. This 
is because a small error in the phase of the cosine introduces the dominant 
increasing exponential in region 2A 

Another linear combination of and vr can be found that gives the 
second connection formula 


sin rj 4- cos (fi — ir + rj) (28.18) 


where rj is appreciably different from zero or an integer multiple of tt. 
The arrow in (28.18) appears since the neglected decreasing exponential 
in region 2 alters the phase of the cosine in region 1 by an indeterminate 
amount if the connection is reversed. 

Energy Levels of a Potential Well. We now give a simple example of 
the application of the WKB approximation that serves as a derivation of 
one of the Bohr-Sommerfeld quantization rules. We wish to find the 
energy levels of a particle moving in the one-dimensional potential well 


^ The converse of (28.17) can be used in the following sense: If some parameter in 
the solution (such as the energy E) is varied continuously so that the phase of the 
cosine in region 1 passes through the value — Jtt, the increasing exponential in region 2 
disappears for some indeterminate value of the phase close to — Jtt and leaves only the 
decreasing exponential. This result is useful, for example, in treating the resonance 
scattering of alpha particles by a heavy nucleus. 
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shown in Fig. 26. For any assumed energy level £/, there are supposed 
to be just two turning points of the classical motion such that 

V(xi) = V(X 2 ) = E 

The regions x < xi and x > x^ are Type 2 regions in which we know that 
u decreases away from the turning points in order to satisfy the boundary 
conditions at + . Thus we have only the decreasing exponential WKB 

solution in these regions. 

The connection formula (28.17) can be applied at the turning point 
which separates a Type 2 region from the Type 1 region xi < x < x^. 



Fig. 26. Application of tho WKB method to a potential trouRli; linear turning points 
occur at x\ and X'i. 

The only change is that the lower limit on the integral is changed from 
0 to Xij so that the solution to the right of the turning point is 

cos kdx — (28.19) 

apart from an arbitrary multiplying constant. The same connection 
formula can also be applied at xo by reversing the direction of the x axis 
and changing the fixed limit on the $ integrals from 0 to X 2 ] the arrow in 
(28.17) still means that we go from a region 2 solution to a region 1 solu- 
tion, but now the latter is to the left of the turning point and the former 

is to the right. We redefine = f: kdx, ^2 = Kdx so that they still 

increase going away from tho turning point, in which case (28.17) can be 
used without any modification. The solution to the left of this turning 

point is then /fi cos “ kdx — which can be written 

cos kdx — i'TT — 17^, rj = kdx — (28.20) 
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As was the case in the qualitative discussion of discrete energy eigen- 
values in Sec. 8, we obtain the energy levels of this system by requiring 
that the two solutions (28,19) and (28.20) join together smoothly in the 
interior of region 1. This evidently requires that rj be zero or a positive 

integer multiple of tt, since kdx is necessarily positive. We can write 

Jxi 

the determining equation for the eigenvalues as 

T' kdx = (n + i)7r, n = 0, 1, 2, . . . (28.21) 

Jxx 

Equation (28.21) is to be used for values of n up to the point at which E 
becomes so large that one or both of the turning points disappears. 

A Quantization Rule. The expression (28.5) for k can be substituted 
into (28.21) to give one of the Bohr-Somerfeld quantization rules of the 
old quantum theory 

2 r" {2i^[E - V{x)]Wx = (n + i)h (28.22) 

Jxi 

The left side of (28.22) is the integral around a complete cycle of the 
motion (from xi to X 2 and back to xi) of the momentum [2/x(£' — F)]^. 
The right side is the quantum value of the phase integral, with half- 
integer rather than integer quantum numbers. 

It is easily seen from the form of the solution (28.20) that n is the 
number of nodes of the WKB wave function between the turning points. 
Since it is basic to the WKB method that we can develop asymptotic 
solutions like (28.7) only several wave lengths from each turning point, 
the approximation should be good only if the turning points are several 
wave lengths apart, or if n is large in comparison with unity. This con- 
firms the earlier view that the WKB method is a semiclassieal approxima- 
tion, since it is expected to be most useful in the nearly classical limit of 
large quantum numbers. 

Actually, the WKB approximation also gives quite good results for 
the low quantum states of many systems. For example, if we apply 
(28.22) to the harmonic oscillator F(:r) = it is known from the old 

quantum theory that the correct energy levels are obtained for all quan- 
tum numbers. 

Special Boundary Conditions. The boundary condition to be applied 
to a WKB solution at a perfectly rigid wall (F changes discontinuously 
to + 00 at a; = xq) is that the wave function vanishes there. Thus if k 
(for a region of Type 1) is slowly varying up to Xo and other turning points 
are remote, the asymptotic solution can be used and has the form 

A*“^sin kdo^- 
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Similarly, for a finite potential step that is far from other turning 
points, the asymptotic WKB solutions can be used up to the point of 
discontinuity of F if or k is slowly varying. Then the magnitudes and 
slopes of the solutions on the two sides can be matched at this point. 

As pointed out after Eq. (28.6), the WKB method can be applied 
to the radial wave equation for a spherically symmetric potential. When 
I — 0, the radial wave function must be finite at r = 0, and so u must 
vanish there. If k or k is slowly varying there, the asymptotic solutions 
can be used; for example, if E — V{r) is positive, finite, and slowly vary- 
ing at and near r = 0, the solution is k’'^ sin kdr^. When the 

effective potential energy is infinite at r = 0, either because V itself is 
infinite or because of the centrifugal-force contribution for I 7 ^ 0, the 
situation is more complicated and requires further investigation. 

Problems 

1. A harmonic oscillator of charge e is perturbed by an electric field of strength E 
in the positive x direction. Calculate the change in each energy level to second order 
in the perturbation. Show that this problem can be solved exactly, and compare the 
result with the perturbation api)roximation. 

2. A harmonic oscillator is perturbed by an extra potential energy Calculate 
th(^ change in each energy level to second order in the perturbation. 

3. Find the first-order Stark otToct for a. hydrogen atom in the state n — 3. 

4. A system that has three unpertuii>cd states can be represented by the per- 
turb(^d Hamiltonian matrix 

(E, 0 a. \ 

(0 Ex h ] 

\a h A', / 

where E-i > E\. llsc the second-order noiuh^geiicrate perturbation theory to find the 
l)erturbed eigenvalues. Tlum diagonalize the matrix to find the exact eigenvalues. 
Finally, us(i tin; sec,ond-ordcr degenerate ])(u*turbation tlu’iory. (/ompare the three 
results obtained. 

6. Show that the total H(;a-tt(;ring (toss s(u*,tion by a potential that falls olT at great 
distance's like is finite if tl > 2, (a) by in(\*uis of tlu^ Born ap])roximation formula 
(2().20), and (h) by means of the Born approximation expression for the phase shifts 
(26.27) (see footnote 1, page 78). 

_r 

6. r'ind the dilTerential scattering <*.ross scMd.ion for a potential V^(r) = — F 06 
using th(‘. Born a])proximation. Wlnit is tlni validity crittTion in this case, and under 
what (‘/irc.umst;anc.e.s is it siitislied? 

7. In a, particular scattering problem in which the potential is sph(n*ically sym- 
metric, the phase shift 60 is large and c.a,n be. computed exac.tly, but all the other phase, 
shifts are small. Den-ive an expre^ssion for tlu^ <lilTm*ent,ia.l s(*att(n*ing cross section with 
the help of th(^ Born approxiimition, in which all the phase shifts are taken into 
account.. 

8. Use: the Born tipproximation to dis(mss (qualitatively the se:attering by a crystal 
lattice of identical atoms. 
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9 . A trial function ^ differs from an eigenfunction ue by a small amount, so that 
— UE €^i, where ue and 4/1 are normalized and e 1. Show that {H) differs 
from E only by terms of order 

10 . If the first n — 1 eigenfunctions of a particular Hamiltonian are known, write 
down a formal expression for a variation-method trial function that could be used to 
get an upper limit on the nth energy level. 

11 . Find the next terms (of order R~^) in the expansion of Eq. (27.12). Show 
that their diagonal matrix element for the unperturbed ground state vanishes, so that 
there is no inverse fourth power contribution to the van der Waals interaction. 

12 . Use the combination of perturbation and variation methods employed in Sec. 
27 in connection with the van der Waals interaction to obtain limits on the electric 
susceptibility of a hydrogen atom in its ground state. The electric susceptibility is 
the ratio of the induced electric-dipole moment to the applied electric field, or is the 
negative of the second derivative of the perturbed energy with respect to the electric 
field at zero applied field. 

13 . A particle of mass m is bound by the potential of Prob. 6, where fh^jmV oa^ = J. 
Use the variation method with a trial function to get a good limit on the lowest 
energy eigenvalue. 

14 . Show that the WKB approximation gives the correct energy eigenvalues for all 
states of the harmonic oscillator. 

16 . Apply the WKB method to the one-dimensional motion of a particle of mass m 
in a potential that equals — F 0 at x =0, changes linearly with x until it vanishes at 
a; — ± a, and is zero for |:c| > a. Find all the bound energy levels obtained in this 
approximation if mV = 40. 

16 . Discuss the connection between the WKB approximation and the penetration 
through “opaque” potential barriers; the barriers are to be like that considered in 
Sec. 17, although not necessarily square. 



CHAPTER VIII 


APPROXIMATION METHODS FOR TIME-DEPENDENT 

PROBLEMS 

It is generally impossible to obtain exact solutions of the Schrodinger 
equation when the Hamiltonian depends on the time. The three approxi- 
mation methods that we consider in this chapter all start from the 
assumption that there is a time-independent Hamiltonian that approxi- 
mates the actual Hamiltonian in some sense, for which the Schrodinger 
equation can be solved. The time-dependent part of the actual Hamil- 
tonian may be small compared to the stationary part, in which case a 
perturbation method can be used. Or there may be time-dependent 
parameters in the actual Hamiltonian that change very slowly (adiabatic 
approximation) or very rapidly (sudden approximation) in comparison 
with the periods of the approximate stationary solutions. 

29. TIME-DEPENDENT PERTURBATION THEORY 
The perturbation theory of a system for which the Hamiltonian 
depends on the time^ is sometimes called the method of variation of con- 
stants. It starts from the assumption of Sec. 26 that 

H = Ho + H', HoUn = EnUn (29.1) 

where the unperturbed Hamiltonian II q can be solved for its normalized 
eigenfunctions Un and its energy eigenvalues and the perturbation H' 
is small. Since H' now dopends on the time, stationary solutions of the 
actual Schrodinger cc][uation do not exist, and we must work with the 
time-dependent equation 

ih (29.2) 

Expansion in Unperturbed Eigenfunctions, Our procedure is to 

iKnt 

express as an expansion in the eigenf unci ions of the unperturbed 

time-dependent wave eciuation, where the expansion coefficients evi- 
dently depend on the time. 

iEnf 

^ = S(^n(f)Une 

1 P. A. M. Dirac, Proc. Roy. Soc., A112, 061 (1926) ; AlU, 243 (1927). 

180 


(29.3) 
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S denotes a summation over the discrete set together with an integration 
over the continuous set of eigenfunctions. Substitution of (29.3) into 
(29.2) gives 

iEnt iBnt iEnt 

where the dot denotes differentiation with respect to the time. 

We replace HoUn by EnUn on the right side, multiply through on the 
left by Hk^ and integrate over all space, making use of the orthonormality 
of the 

iEkt iEnt ^ 

ihdke * = ^ j UkH'Undr 

The integral on the right is the matrix element of the perturbation. 
We define the Bohr (angular) frequency 

Ct^kn — ^ 

and obtain 

The group of Eqs. (29.5) for all A*’s is exactly equivalent to the Schro- 
dinger equation (29.2); the amplitude an of a particular unperturbed 
eigenfunction Un in the expansion of i/ has replaced the amplitude V' at a 
particular point in space. Because of the choice of the representation, 
which is determined by the eigenfunctions of the unperturbed Hamilton- 
ian, Ho does not appear explicitly in (29.5). 

The perturbation approximation consists in replacing H' by \H' in 
(29.1) and (29.5), and expressing the a’s as power series in X: 

+ • • • (29.6) 

As in Sec. 25, we assume that these series are analytic for X between 0 
and 1. We can therefore substitute (29.6) into (29.5), equate coefficients 
of equal powers of X, and set X = 1 in the final results. The substitution 
yields the set of ecpiations 

4"> = 0; s = 0, 1, 2, . . . (29.7) 

These can in principle be integrated successive!}^ to obtain approximate 
solutions to any desired order in the perturbation. 

First-order Perturbation. The first of Eqs. (29.7) shows that the 
zero-order coefficients are constant in time. Their values are the 
initial conditions of the problem, which specify the state of the system, 
before the perturbation is applied. We assume throughout this section 


(29.4) 

(29.5) 
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that all except one of the are zero, so that the system is in a definite 
unperturbed energy state when the perturbation is applied.^ The results 
that we shall obtain can easily be generalized to situations in which more 
than one of the zero-order coefficients is different from zero. 

We thus put or 5 (A* — m), according as the state m is one 

of a discrete or a continuous set. Integration of the first-order equation 
gives 

a^it) = ^ (29.8) 

where the constant of integration is taken to be zero in order that 
be zero at ^ = — oo (before the perturbation is applied). If H' is of 
finite duration, the amplitude of a state Uk (A* m) after the perturbation 
has disappeared is proportional to the time Fourier component of the 
matrix element of the perturbation between this state and the initial state, 
that corresponds to the angular frequency o)km given in (29.4). This 
result is analogous to that obtained for the scattered amplitude in the 
Born approximation [see the discussion of Eq. (26.18)]. 

Equation (29.8) takes a particularly simple form if the perturbation 
H' is independent of the time except for being turned on at one time and 
off at a later time. We call these two times 0 and respectively, and 
obtain for the first-order amplitudes at the time t (these are also the 
amplitudes at any subsequent time) 

TP. — 1 

ant) = - - f* — ^ (29.9) 

Thus the probability of finding th(^ system in the state A* at t is 

The factor sin- ^ookml/o)l,„ is plotted in Fig. 27 as a function of co/cm. 

Physical Interpretation. The height of the main peak in Fig. 27 
increases in proportion to //“, while its breadth d(Mn'eases inversely as t, so 
that the area under the curve is propo!*tional to t. Thus if there is a 
group of states k that have energies mvu’ly ecpial to that of the initial 
state m, and for whi(*,h ///,,„ is roughly independent of A*, the probability of 
finding the syst(nn in oru^ or a^notJuu* of thes(^ st-ates is pro])ortional to t. 
This is the physically int.(M-(^sting situation, siina) wliat wo wish to calculate 
eventually is a IraimMon probability per un.it time, w, and this implies that 

^ This iKH'd not conflict: with tlu^ unc(*rt:iinty n'ljit/ion ( 3 . 3 ), Hinc.c tlu^ infiniU; lapse 
of time prior to the jipplicjitioii of p(*rtiirhM.t,ion in;ik(^s it poHsible to detertniue the 
()ri{2:inal energy of tlu^ .syst^eni with arhitrarily gr(‘a.t ])recision. 
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the probability that a transition has taken place when the perturbation 
has been on for a time t is proportional to tA 

It follows that a definite value of w exists only when the final state k 
is one of a continuous or nearly continuous set of states. The spread in 
energy of the final states to which transitions occur, shown in Fig. 27 



Fig. 27. The ordinate iw proportional to the first-order perturl)ation probability of finding 
a system in a state that has an energy different from that of the initial state by hujkm.', the 
scales for ordinate and abscissa depend on the duration t of tlie perturbation in the manner 
indicated. 

(Ek = E,n + hoikm), is connected with the uncertainty relation (3.3) for 
energy and time in the following way. We can regard the perturbation 
/f' as a device that measures the energy of the system (which is not 
necessarily its initial energy since the system is disturbed) by transferring 
it to one of the states k. The time available for the measurement is 
so that the uncertainty in energy predicted by (3.3) is of order h/t^ in 
agreement with the breadth of the main peak in Fig. 27. It is interesting 

^ We assume that the total transition probability to all states k is small enough in 
comparison with unity so that the initial state vi is not significantly depleted. This is 
equivalent to the original assumption that the perturbation is small, which means 
that for times t of physical interest, there i little change in the initial state. There 
can still be an effect of observable magnitude if a large number of independent system^ 
receive identical treatment. 
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to note that conservation of energy, suitably modified by the uncertainty 
principle, is an automatic consequence of the calculation and does not 
have to be inserted as a separate assumption. 

Transition Probability. In order to obtain an explicit expression 
for w, it is convenient to assume that the system is contained in a large 
cubical box of dimensions L that has periodic boundary conditions at its 
walls (Sec. 10). Then the eigenfunctions xin form a discrete set and can 
be normalized to \inity in the volume IJ. We now consider a particular 
group of final states h that have nearly the same energy as the initial 
state m and for which the matrix element of the perturbation is a 
slowly varying function of k. AVe define a density of final states p{k) 
such that p{k)dEk is the number of such states in the energy range 
and assume that p(/r) is also a slowly varying function of A*. 

The transition probability per unit time to one or another of this group 
of states can then be written 


«) = ^ |ai?>(/)|^ = r* f (29.10) 

Ic 


when the box L is large enough so that the summation over k can be 
replaced by the integration over Ek, Since and p{k) are slowly vary- 
ing and most of the contribution to the integral comes from a narrow 
range of energy about Ek == they can be taken outside of the integral, 
and (29.10) can be rewritten as 




1 4 


1//L 

h 


P(k) 


/: 


sin^ icokmt 


^kin 


do)k 


(29.11) 


where the index k now refers to a typicial one of the group of states having 
about the energy Eu,. Tlu^ integral in (29.1 1) is J sin**^ xdx = -g-Tri, 

so that we finally obtain 

w (29.12) 


which is independent of I, as expcud.ed. 

There may b(^ several dilTerent groups of final states k, all of which 
have about the energy E„, but for which the perturbation matrix ele- 
ments and the densities of state's p{k), while nearly constant within a 
group, ditler from one group to anotluM*. Them (29.12) gives the transi- 
tions per unit time to a particudar group; similar expressions of the same 
form give the rates of transition to other gi’oups. 

Scattering Cross Section. As a first application of Kq. (29.12), we 
calculate w when th(^ initial and final stah^s are frec-particle momentum 
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eigenfunctions (plane waves) and the perturbation is a potential energy 
y(r). The result can be interpreted in terms of an elastic scattering cross 
section, and then agrees with the Born approximation result (Sec. 26), as 
expected. We take for the initial and final states 

t^^(r) = exp iko • r, Uh{t) = L~^ exp ik • r 

where ko and k are the initial and final propagation vectors, respectively. 
Thus the perturbation matrix element is 

exp (“zk • r)F(r) exp (iko • x)dr 

= L-3j7(r) exp (^K • r)dr (29.13) 

where K = ko — k. 

The density of final states can be found from the permitted values of fc 
in a box: A-* = 27rna./Z/, etc., where the n^s are positive or negative integers 
or zero. Thus there are {L/2Trydks!dkydkz states in the range dkxdkydkz of 
propagation vector. Now there are many different final states k with the 
same energy, corresponding to different directions of k with a given 
magnitude. The matrix element (29.13) usually depends on the direction 
of k, so that we have to consider only a small range of directions at a time. 
We therefore ask for the rate of transition into an infinitesimal element of 
solid angle sin dddd<i> about some direction that is specified by the polar 
angles Then p(k)dEk is equal to the number of states in the range 

dTk given by the above solid angle element and the magnitude element dk 
that corresponds to the energy element dEk. 

kHk sin ddddcj) 

Since Ek = h"^k'^l2pj dEk/dk = h“k/nj and we obtain for p{k) 

p{k) = 3 ^ - ., k sin 9d6d<i> (29.14) 

The value of w obtained in this way is the number of particles scat- 
tered into the element of solid angle per unit time when there is one 
incident particle in the volume L^. This is an incident flux of v/L^ 
particles per unit area and time, where v = hk/p is the speed of the inci- 
dent or scattered particle (since energy is conserved). Since the dif- 
ferential scattering cross section is defined as the scattering per unit 
incident flux, we have that 

ulL ^ 

cr{6j<j}) sin 6d6d(l> = ^ v) 
nk 


o{k)dE, = 




(29.15) 



Sec. 29] METHODS FOR TIME-DEPENDENT PROBLEMS 
Substitution of (29.12), (29.13), and (29.14) into (29.15) gives 


= (2&)' 1/ 


195 


(29.16) 


This agrees with the Born approximation result (26.18) and (26.19), 
and has the same range of validity. 

Harmonic Perturbation. Another situation for which Eq. (29.8) 
assumes a simple form occurs Avhen the perturbation depends harmoni- 
cally on the time, except for being turned on at zero time and off at time t. 
If we put HknXt') = sin o)t'j the first-order amplitudes at time t are 


a^Kt) = - 


2ih L ^km + 0) 


CO/crn ^ 


(29.17) 


The probability of finding the system in the state k is appreciable only 
when the denominator of one or the other of the two terms in (29.17) is 
practically zero. Thus there is no interference between the two terms, 
and the perturbation can produce transitions for which ^ ±a? if the 
corresponding matrix element does not vanish. The energy-conserva- 
tion condition Ejc ^ Em obtained earlier is now replaced by the condition 


Ek ^ Em ± hoi 


(29.18) 


Equation (29.18) suggests that the first-order effect of a perturbation 
that varies harmonically in the time with angular frequency w is to trans- 
fer to or re(*civo fi'om the system on whi(!h it acts an amount of energy 
hoi. This coiKu^pt will be used for a qiuilitative treatment of radiation 
processes in ( -hap. X. 

Second-order Perturbation. The scries of cciuations (20.7) can read- 
ily be solved to second or'dcr for a i)crtuti)ation that is constant in time. 
We take the equation with s = 1 , and substitute from (29.9) on the right 
side. 




O^nni 




Integrjition of this ociiiation suhjoc.t to the initial condition a'/?'(0) = 0 
gives for the second-order iuni)lil.ii(les at time t 


_ If r,.ii 

„,),?)(/) = h-^S - 

0)nni 


ki>,t — 


— I 
Oikti 


(29.19) 


Equation (29.19) indicates that transitions for whic.h the probability 
increas(is liiKiarly wit h the t ime can occur eitiher ior oj;,.,,, ^ 0 or ajA:n = 0- 
While the first type of triUisition conserves energy between the initial 
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state m and the final state the second need not. It is not difficult to 
see that the second bracket term arises from the 1 in the numerator of 
(29.9), which in turn comes from the initial condition at zero time. This 
initial condition means that the perturbation is turned on suddenly; thus 
the mathematical formulation suggests that the second-order transitions 
that do not conserve energy are caused by the sudden turning on of the 
perturbation. This is in agreement with Eqs. (29.8) and (29.17), which 
show that a perturbation that has nonzero frequency Fourier components 
can give up energy to or absorb energy from the system that it perturbs. 
In the case we are now considering, these Fourier components are not 
marked enough to produce in first order a transition probability that is 
proportional to the time, but they do in second order. 

In most practical problems, the sudden turning on of the perturbation 
is introduced as a mathematical artifice that simplifies the calculation. 
Actually, in such cases, the perturbation either is always present, or is 
turned on very slowly, and we are concerned with transitions that con- 
serve energy between initial and final states. Problems that can be 
treated by the sudden approximation (see end of Sec. 31) are an excep- 
tion; there energy need not be conserved. Throughout this section and 
the next, we assume that only transitions that conserve the energy actu- 
ally occur (cOftm = 0), 

Suppose now that the perturbation produces no transitions in fii*st 
order; this means that there are no states n that conserve energy (co„„» ^ 0) 
for which the matrix element 0. Since = 0, this means also 

that = 0 whenever = 0. In this case, the second term in the 
bracket of (29.19) is never appreciable. The calculation of w is carried 
through as before, except that replaces thus (29.12) can be used 
if the matrix element is replaced by the second-order matrix element 

S eT-'k 

Effect of First-order Transitions. In the event that transitions can 
occur in first order, but they are not to the state in which we are inter- 
ested, we can proceed as follows. It is still true that the second term in 
the bracket of (29.19) is negligible for states n that have energies appreci- 
ably different from Ek (or jE^), since then wkn is large. However, there 
may now be states n for which En, E^j and Ek are all close toget her and 
neither nor is zero. The second bracket term cannot be ignored, 
for without it the summation or integration over n would have a singu- 
larity when ojnm is zero. It is not difficult to see that for any value of 
o)hm (zero or otherwise), the entire bracket is proportional to conm (which 
is equal to cjkm — w&n) when wnm is small; this cancels out the conw 
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in the denominator, and makes the summand or integrand finite 
where conm = 0.^ 

We now show how an explicit evaluation of (29,19) is obtained in this 
case if S can be represented by an integral over or 6j„m. We divide the 
integral into parts according as |w„w| is large or is not large in comparison 
with l/t In the first region, the second bracket term in (29.19) can 
be neglected, since \(jJhn\ = Icojtm — o)nm\ is also large in comparison with 
l/t (o3km = 0 means that (akmt is not large in comparison with unity). 
We thus obtain for this part of the integral 

~ ^ f p(n)Mu>„„ (29.21) 

Oikm J Wnm 

Here p{n)dEn is the number of states of the particular group n under 
consideration in the energy range dEn] the prime on the integral implies 
that the region —c/t g conm ^ c/t is excluded from the integration, where 
c is a constant number that is large in comparison with unity. If there 
are two or more distinct groups of states n for which the matrix elements 
or densities of states differ, a further summation over these different 
groups must eventually be carried out. 

In the second region, where ^ c//., we assume that t is large 
enough so that can be regarded as a constant, taken outside 

of the integral, and evaluated at Wnm = 0. Wo must now use both terms 
in the bracket of (29,19) in order that the integrand bo finite. This part 
of the integral is then 


— 1 

— 

c \_ 


\ (29.22) 


The integral that appears in (29.22) can be evaluated by considering the 
contour in the complex co„,« piano shown in Fig. 28, which contains no 
poles of the integrand. Thus the integral over the closed contour is 
zero, and the integral in (29.22) is ecjual to the integral around the semi- 
circle of radius c/t taken in the counterclockwise direction. The magni- 
tude of Wnm is great enough over this semicircle so that the contribution 


1 This rcBalt follows quite gGiuu’ally from the strue^turo of tlie whole perturbation 
calculation, since there is no way in which a singularity ean appear. Thus if the 
perturbation is turned on slowly rath(\r than suddenly, so that the esnergy-eonsorvation 
difficulties mentioned above do not occur, the second bracket term of (29.19) has a 
more complicated form but still cancels out the singularity at oinm ™ 0. This can bo 
verified by direct calculation. 
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of the second term in the integrand can be neglected in comparison with 
the first. The integral in (29.22) is then easily evaluated and becomes 

Tri (29.23) 

For large t, the prime on the integral in (29.21) is equivalent to taking 
its principal value. ‘ Thus if we substitute (29.23) into (29.22) and add 
the result to (29.21), we obtain an expression like (29.21) except that the 
primed integral is replaced by the principal value of the integral plus 
xi times the residue of the integrand at the pole = 0. This is equiva- 
lent to evaluating the integral along a contour in the complex unm plane 



Ftg. 28. Contour for the evaluation of the integral in Eq. (29.22). 

that passes along the real axis from — ooto oo except for passing beneath 
the origin. We thus obtain finally 

X ‘28.24) 

where the contour C is over the real axis of except for passing under 
the pole of the integrand at Equation (29.24) is to be used in 

place of (29.19) whenever S can be represented by fp(n)dE„, Compari- 
son of Eqs. (29.24) and (29.9) shows that we can use the expression (29. 12) 
for w if we replace the matrix element //*,„ by the integral in (29.24), 
which we sometimes call the second-order matrix element. An example 
of this is given in the next section. 

Intermediate States. We sec that the time-dependent perturbation 
theory gives a result in first order if there is a nonvanishing matrix 

‘ E. T. Whittaker and G. N. Watson, “A Course of Modern Analysis,” 4th cd., 
pp. 76, 117 (Cambridge, London, 1935). 
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element of W that connects the initial state m and the final state A*. 
If = 0, but there are one or more states n for which neither nor 
is zero, the transition occui's in second order. 

It is then convenient to think of one of the states n as an intermediate 
Btate: the perturbation transfers the system from m to fr in two steps, 
through a state n. Energy need not be conserved for an intermediate 
state, since it has only a transient existence and according to the uncer- 
tainty relation (3.3) it is impossible to determine the energy of such a 
short-lived state with any precision. If some of the intermediate states 
do conserve energy, the summation (29.20) over these states must be 
interpreted in accordance with the integral in (29,24). 

In some cases, a perturbation can produce a particular transition only 
through two or more different intermediate states; this corresponds to a 
third or higher order of the perturbation calculation. If the perturba- 
tion is small, it usually happens that the result of a calculation to the 
lowest order in which the transition occurs gives a useful result, while 
higher order calculations do not improve on this and may even be quite 
misleading. 


30. INELASTIC COLLISIONS 

The expression for the scattering cross section given in the preceding 
section is easily generalized to a description of inelastic collisions, in which 
internal as well as kinetic energy can be transferred between the colliding 
systems. In this section we apply the result to two problems that are 
typical of first-order and of second-order processes.^ The latter calcula- 
tion is of unusual theoretical interest, for it shows explicitly how a particle 
that is described enlh*ely in terms of a plane wave (momentum eigen- 
function) can produce a sharp track in a Wilson cloud chamber. 

Expression for the Cross Section. The expression (29.12) for the 
rate of transition w is applicable to inelastic collisions if the matrix cle- 
ment is defined accordingly. We consider here the collision of a fast 
electron with a hydrogen atom in its ground state, and wish to calculate 
the cross section for scattering of the electron through a definite angle 
accompanied by oxc.itatioii of the hydrogen atom to a definite state. We 
leave oui- of (ionsid(‘ration the possibility that l»he incident electron 
changes places with the atomic electron; such exchange collisions will be 
taken up in Chap. IX. 


^ The oxamploH nonsidcirncl in this section can also bo treated by an extension of 
the Born approximation; such an extension to first-order rearrangement collisions is 
given in Hoc. 34. For the tnmtinimt of second-order processes, it is more convenient 
to work with the method of variation of c.onstants. 
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The unperturbed Hamiltonian ia the sum of the kinetic energy of the 
incident electron and the Hamiltonian for the hydrogen atom : 


H, 


— — V? — — VS — — 
2m ^ 2m ^ 


(30.1) 


where ri and la are the coordinates of the incident and atomic electrons, 
respectively, referred to the atomic nucleus, which is massive enough so 
that its motion can be neglected. The peiiiurbation is the electrostatic 
energy of interaction between the incident electron and the electron and 
nucleus of the atom 

JI' = — - ~ (30.2) 

ri2 Ti 

The unperturbed wave functions are eigenfunctions of (30.1), which 
we choose to be 

I/“ 8 exp (tko ■ ri)wioo(r 2 ) initial state 

I/“ * exp (^k • ri)w 2 oo(r 2 ) final state ^ ^ 


In spectroscopic notation, this is a 15 — > 25 transition of the atom. The 
magnitude of the propagation vector of the electron after the collision is 
fixed by conservation of energy 




Sao 


(30.4) 


Equations (30.2), (30,3), and (30.4) specify the matrix element that 
appears in (29.12). 

j j exp (^K • ri)u 2 oo(r 2 ) - ^^Wioo(r 2 )dridr 2 , 

K = k„ - k (30.5) 

The differential cross section can be obtained from w as in the preceding 
section. It must be remembered, however, that k appears in the expres- 
sion (29.14) for the density of final states, whereas the initial speed 
Vq = hko/m appears in the expression for the incident particle flux. Thus 
the cross section is 

where B is the angle between the vectors k and ko. 

Evaluation of the Matrix Element. It is apparent from the structure 
of the matrix element (30.5) that the term e^/vi in the integrand contrib- 
utes nothing because of the orthogonality of Wioo and W 200 . This is to be 
expected physically, since interaction between the incident electron and 
the nucleus cannot produce excitation of the atomic electron. 
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The integration of the remaining term over ix can be carried out by 
transforming the volume element from dTidr 2 to drpdr^^ where 9 = ri — r2; 
the Jacobian of the transformation is easily seen to be unity. We can 
then write 


J Ti2 


where we have taken the polar axis of g along the vector K and put w 
for the cosine of the angle between 9 and K, The last integral is not 
strictly convergent, but can be evaluated by inserting an integrating 
factor into the integrand and subsequently taking the limit a — > 0. 
The justification for this is that the integration over ra in (30.5), if per- 
formed first, gives a result that falls off like 1/rf and hence like 1/p^ for 
large so that for large p, the integrand here behaves like sin Kp/p and 
the integral converges. We thus obtain 

f C L P r J exp (,:k . ro) lim [ sin Kpe-<‘>‘(}p 

J ri 2 /v ^ Jo 

= ^ exp {iK • rs) = ^2 (30.7) 


= exp (iK • ra) j 

= 2 t exp (iK ' J J e*^'‘"pdpdw 

/ sin Kpdp 

Jo 


= — exp (iK • I 2 ) 


Substitution of hkp (30.7) and the expressions following Mq. (1().24) for 
the hydrogen wave functions into (30.5), gives an integral over r2 that 
can be evaluated as 


If = J -3 10 \/2iraJr- 


ao 


J! 

me^ 


Differential and Total Cross Sections. The differential cross sec.tion 
for this collision thus becomes 


where 


c{e) 


A- 128a?, 

hWal + 1 )<> 


(30.8) 


= kl + A-’* - 2hk coH e 

= (2A-0 sin i^)'^ — (A’o — A-)(A'o + A" — 2/fo cos d) 


^ The loiidiuK U^rni in l/ri-.! when ri » /'a in 1/ri, and the integral of this over To 
vaniahos bucaufjc of the orthogonality of Wiou and -Uaco. 
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Now the perturbation calculation is best when A’oOo ;s> hi which case i 
is close to A-o and we can rewrite (30.4). 


n i\n _i_ 7 \ 2m 3e* _ 3 
(A-o A-)(A-» + A-) 


Jco + k ~ 2koy 


fro — A* = 


8froa§ 


(30.9) 


With the help of (30.9), the expression for /C- in this high-energy 
limit becomes 

if 2 ^ ^ 4^2 _ iQ ^ ('2fro sin id)^ 

Then according to (30.8), most of the scattering occurs for Ka^ ^ 1, 
which is equivalent to 1/froao. Beyond this, cr(6) falls off with 
increasing angle approximately like cosec^^ .i This ig ^ much more 
rapid decrease with angle than the cosec^ ^6 dependence (26.32) obtained 
for elastic scattering by an atom, and is characteristic of inelastic 
processes. 

The total cross section is found by making use of the exact expression 
for to replace the element of solid angle 27rsin Odd by 27r7idJC/frt)fr, 
with limits fro — fr and fro + fr. Then the integral of (30.8) can be 
obtained explicitly. However, the discussion of the last paragraph 
shows that at high energies most of the contribution to the integral comes 
from near its lower limit, and in accordance with (30.9) 


(fro — kyal ^ 


9 



We thus obtain the leading term in the total cross section at high energies 
by taking the limits 0 and oo for K : 


<x ~ 


1287r /2Y° 

5fr2 Vs/ 


(30.10) 


Cross sections for elastic and other inelastic collisions with hydrogen 
may be obtained by replacing 2^200 in the matrix element (30.5) by the 
appropriate final-state wave function, and modifying (30.4) accordingly. 
The total elastic cross section at high energies turns out to ho 77r/3fr5, 
which is about five times as large as (30.10). Excitation i,o tlio states 
that have n = 2, i = 1 (1/S — ► 2P transitions) is most easily (calculated by 
choosing the three final states (m = 0, ±1) with tlucir polar axis along 
the momentum transfer vector K. Then the factors that appear in 
the wave functions for m = ± 1 make these matrix elements vanish and 
only the state (210) is excited. This corresponds physically to the 
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inability of the incident electron, whose momentuna loss is along K, to 
exert a torque on the atomic electron about this axis. The high-energy 
cross section for this process turns out to be 

O' ^ In (4^000) (30.11) 

The appearance of the logarithmic factor in Eq. (30.11) derives from an 
extra factor l//f- in the differential cross section. Thus in comparison 
with the IS — > 2S scattering, the IS 2P differential scattering is more 
pronounced at small angles and the total scattering decreases less rapidly 
with increasing energy at high energies. 

Production of a Cloud-chamber Track, It seems surprising at first 
that a fast electron, which we can assume possesses a definite momentum 
(magnitude and direction) and hence cannot be localized in space, can 
produce a sharp track in a cloud chamber. This phenomenon may be 
considered from various points of view. In accordance with Ehrenfest’s 
theorem (Sec. 7), we can represent the electron by a wave packet whose 
center of gravity moves like a classical particle. If the wave length is 
short enough, the packet can be quite small without spreading rapidly, 
and will then interact only with atoms that lie close to the path of its 
center. This implies that the elec^tron is represented by a superposition 
of plane waves and hence has an uncertainty in its momentum that 
enables its posii.ion to be sufficiently well defined. 

Anollier appi'oach consists in describing iho electron by a single plane 
Avave, and regarding its intorac,tion with the first atom that it excites or 
ionizes as a position measurement that carries with it an uncertainty of 
the order of l,ho atomic, size. Thereafter, tlie electron is represented by a 
packet, like that descM’ibed in tlie last paragrapli, which is well localized 
if the first atom is large in compai’isoii witli the wave length. 

We consider here in del-iiil a description in Avhich the electron and the 
atoms of tlie cloud-chamber gas are ij*eatofI as ]’)arts of a single system, so 
that we do not have to regard an atomic interact, ion as a position deter- 
mination that chjuiges t,he structure of the electron’s wave function.^ 
To simplify mattcM’s, wo assume that there are just two atoms present in 
their ground states, and Unit their nuclei arc far fi’om each other and are 
fixed in spac^e. We then (^ahuilat-e the cross section for a proc^ess in which 
both atoms are excJted and the electron is scattered inelastically. For a 
fast incident elecd.ron, the perturbatiem theory can bo used, and the 
proc,ess is of sec.ond order. The calculation is interesting both because 

^ Soo jiIho W. llciHcnbcrg, “Tlui Pliysipal Ib’inciph^H of tho Quantum Theory,” 
p. 06 (Univ(T.siiy of ('liiciigo ProHH, ( 1030). 
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of the answer obtained and because it provides an instructive application 
of the second-order perturbation theory developed in Sec. 29. 

The result of the calculation is that the cross section is very small 
unless the momentum vector of the incident electron is nearly parallel 
to the line that joins the two nuclei, and unless also the incident and final 
electron momenta are nearly parallel. These three directions can have 
an angular spread in radians that is of the order of the ratio of the wave 
length of the electron to the size of the atom. This is analogous to the 
result obtained above for the inelastic collision of a fast electron with a 
hydrogen atom: the angular spread of the scattered electron was found 
to be roughly l/froao- It is also in agreement with the wave-packet 
description of the process, since a localization of the electron by an atomic 
size a in a direction transverse to its motion produces an uncertainty in 
the transverse momentum component of amount h/a and an angular 
spread of order h/ap ^ l/froot. 

Formulation of the Problem. The nucleus of the first atom can 
without loss of generality be placed at the origin, and that of the second 
atom at the point R. The two atoms are assumed to be far enough apart 
so that the interaction between them can be neglected. The unperturbed 
Hamiltonian is then the sum of the kinetic energy of the incident electron 
and the unperturbed Hamiltonians of the two atoms. The perturbation 
is the sum of the interaction H[ between the incident electron and the 
first atom, and the interaction H 2 between the electron and the second 
atom. In the initial state, both atoms are in their ground states Uo with 
energies € 0 , and the incident electron has the propagation vector ko. In 
the final state, the first atom is in the state Un with energy cn, the second 
is in the state iim with energy 6m, and the propagation vector of the elec- 
tron is k„m- 

It is apparent that the transition in which we are interested cannot 
occur in first order. It can occur in second order, and there are two 
groups of intermediate states. In the first group, the first atom is in the 
state Un, the second in the state Uo, and the incident electron has some 
propagation vector k^o. In the second group, the fimt atom is in the state 
Uo, the second in the state Um, and the propagation vector of the electron is 
called kom. Thus the second-order matrix element (29.20) is 


EqQ SJnO Eqo — Eq 


)nm,00 

Om 


P - 2 . 4- 
E,, - 260 + 


E, 


Qm 


EnO = €n + eo + 




nO 


2m 


€0 + €m + 


2m 


(30.12) 
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"We evaluate only the first sum in (30.12) explicitly and indicate the 
ch-«"Uges that are to be made in the result to obtain the second sum. The 
iriatirix elements that appear there are 

{HD nm,nQ 

== exp (— iknm * x)H[{2,x)uq{2) exp (ikno • x)dr^T , 

(i^Z'Dno.oo 

= L”®//^n(l) exp (— ikno * r')'^/l(l,r')wo(l) exp (iko • r')c^ridT' 


1 and 2 denote all the internal coordinates of the first and second 
atoms, respectively, and rfn and dr 2 are the corresponding elements of 
integration; r and r' are variables of integration with volume elements 
dr and dr', both of which specify the position of the incident electron with 
respect to the origin. An integral over 1 has been performed in the first 
of Eqs. (30.13) to give unity, and a similar integral over 2 has been per- 
f ox'med in the second equation. 

Evaluation of the k Sum. When the matrix elements (30.13) are 
sixlbstituted into the fii’st sum of (30.12) and the sum and integrals are 
int»erchanged, we must evaluate 



exp ?kn 


AI0 



/C- = hi “ ^ (€n - €0) 


(30.14) 


For a box of sufficiently large dimensions L, we can replace the summa- 
tion in (30.14) by an integration: 


L\ f exp *nn • (r “ x') 


(30.15) 


Tlie integriil in (30.15) luus the form of that which appeared in the Greenes 
fvinction (20.12) for a free particle. Thus the only new problem pre- 
sented by evaluation of (30.15) is the determination of the contour 
ttLCbt takes i)r()per account of the singularity of the integrand at kno = k. 

This singularity is of the type discussed near the end of the last section 
arid arises^ from the possibility of first-order transitions in which just 
orio of the atoms is excited, Eejuation (29.24) shows that the proper 
contour to use for tlu^ integral over the magnitude kno of the vector kno is 
oiio that go(^s from 0 to + co by passing under the real axis at the pole/c. 
Afber the angular intc^gnitions are performed, the integrand is even in 
A" rtf), and tlufs <'.ontour run bo reflected in the origin. The resulting con- 
toiir, whi(;h exf.ends from — 00 to +co^ is exactly the same as that used 
in "the evaluaiion of (20.13) and shown in Fig. 21a. We therefore obtain 
from the (3arlier work an explicit expression for the summation (30.14): 


‘l7r[r 


— r' 


exp ?'/c|r — r'l 


(30.16) 
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Second-order Matrix Element. It is convenient to rewrite the matrix 
elements (30.13) in terms of new functions 

F„(r - R) = ;u„(2)JIJ(2,r)uo(2)dr2 
F„(r') = /u„(l)Hi(l,r')wo(l)dri 

These functions are very small except when their arguments differ from 
zero by distances of the order of the size of the atom. We put 

r" = r - R, 

so that practically all the contribution to the firat summation of (30.12) 
comes from small values of r' and r". We can then approximate 

P ■ r" P • r' 

|r-r'l = lR + r"-r'|^S+^^-^^ 

|r - r'|-i ^ 

to obtain the leading terms in (30. 16) for large R. The first summation of 
(30.12) then becomes, to this approximation, 

V . ) _ -g J ^■„(r')[exp f(k„ - h) • r']dr' 

• J /'’m(r")[exp i(tc - k„,„) • r"]r/r" (30.17) 

where k is a vector of magnitude k given by (30.14), tluil luis the direcition 
of R. 

In similar fashion, the second summation of (30.12) Ix^toines 
_ ^ ^^ expf(K^ + ko) - R j 

• j ^’m(r")[exp f(k„ + k') • r"]r/T" (30.18) 

where k' is a vector in the direction of R whose magnitude' is given hy 
(30.14) with cn replaced by 

The differential cross section is obtained by substitiil.ing the sum of 
(30.17) and (30.18) for I-I 21 in (30.6), and replacing A* by /cnm. C'Oii- 
servation of energy requires tliat 

- ^ (*» + - 2eo) 


Discussion of the Cross Section. The integrals that appear in (30.17) 
and (30,18) have the characteristic structure associated with the p(‘r- 
turbation treatment of collision problems. They are very small unless 



Sec. 31] 


METHODS FOR TIME-DERENDENT PROBLEMS 


207 


the propagation vector that appears in the exponent of the integrand has 
a magnitude that is of order 1 /a or less, where a is a typical linear dimen- 
sion of the atom {F significantly different from zero). It follows that 
(30.17) is significant only when the vectors ko, k, and knw are nearly equal 
in magnitude and direction. Because of the assumption that the inci- 
dent electron is fast, the magnitudes are very nearly equal in any event. 
Then the cross section is appreciable only when the vectora R and k„m are 
nearly parallel to ko. The permitted angular deviation from parallelism 
is easily seen to be of order l/k^a. 

In similar fashion, it follows that (30.18) is significant only when 
k', and hence R, is nearly antiparallel to both ko and k„„i, in which case 
the latter two vectoi's are nearly parallel to each other. 

The two terms together show that excitation of both atoms occurs 
with appreciable probability only when the line joining the two atoms is 
nearly parallel to the direction of the incident electron. It is apparent 
also that the cross section falls off invei’sely as the square of the distance 
R between the two atoms, lis would be expected. 

31. ADIABATIC AND SUDDEN APPROXIMATIONS 
In this section we develop approximation methods that involve the 
rate of change of the Hamiltonian, rather than the magnitude of the time- 
dependent part of the Hamiltonian. If the Hamiltonian changes very 
slowly with the time, we expect to be able to approximate solutions of 
the Schrodinger ccination by means of stationary energy eigenfunctions 
of the instantaneous Hamiltonian, so that a partic.ular eigenfunction at 
one time goes over continuously into an eigenfunction at a later time 
(adiabatic approximation). If the Hamiltonian changes from one steady 
form to anotluM* over a very short l.iine interval, we expect that the wave 
function (hx^s not (change miK^h, although tlio expansion of this function 
in eigenfuiK^tions of tlin initial and final Hamiltonians may be quite dif- 
ferent (sudden api)roximatiou). We (hd, ermine lu^re to what extent both 
of these ty]xis of approximation are valid. 

Expansion in Instantaneous Energy Eigenfunctions. We consider 
fii-st the a<lial)a(.ic approximation, and wish to solve tlie Schrodinger 
e(iuation 

ih’^ = nm (31.1) 

(ft 

when //(/) varies slowly with the time.' Tho solutions of the energy 
eigenvalue equation at each instant of time^ are assumed to be known. 

mOUnit) = En{l)llu{l) (31.2) 

1 1\I. Horc hikI V. Zrits. f. P/iyuik^ 61, ir)5 (1028); P. CUUtinRcr, Zeiis. f. 

Physik, 73, 1(>0 (1031). 
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We assume also that the Un are orthonormal, nondegenerate, and dis- 
crete; their phases are fixed below. 

Suppose that the wave function is known at zero time; at later times 
we put 

exp - I E„indt^ (31.3) 

n 

Substitution of (31.3) into (31.1) gives 
ih ^ ^ ~ ^ UnUnE^ exp | ^ j 

i-ii: 

Since Hun = EnUn from (31.2), the last term on the left side cancels the 
right side. Multiplying through on the left by Uh and integrating over 
all the coordinates of the system (Jdr), we obtain 

dft = - ^ a„ exp (E, - E„)dt'^ ■ j dr (31.4) 

n 

We now seek an expression for the integral on the right side of Eq. 
(31.4) that is easier to interpret in physical terms. Differentiation of 
(31.2) with respect to t gives 


= a„u„ 



4. TT 


^En \ rp 


Multiplying through on the left by and integrating over the coordi- 
nates gives 

J iZk~u,4r+ j UkH ^ dr = En J u, dr, k w (31 .5) 

We make use of Eq. (22.10) to rewrite the second integral on the left 
side of Eq. (31.5), and remember that H is Hermitian. 

- f 

Substitution into (31.5) gives an expression for the integral on the right 
side of (31.4) 



n k (31.6) 
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Choice of Phases. In order to rewrite Eq. (31.4) along the lines 
indicated in the last paragraph, we must have an expression for 

We now show that this integral is pure imaginary, and that a proper 
choice of the dependence of the phase of Un on t makes it zero. Differen- 
tiation of the normalization integral for Un gives 

0=1^/ + I dr 

Since the two integrals on the right are complex conjugates of each other, 
each must be pure imaginai’y: iunidUn/dt)dT = icx{t). 

We now change the phase of Un by an amount 7 ( 0 , which is permissible 
since the phases of the eigenfunctions are arbitrary at each instant of 
time. For the new eigenfunction ~ 

/ ^ ^ (.u„e^)dT = ia{t) + i 7(0 (31.7) 

Thus the choice t(0 = oL{t*)dt' for the phase makes the integral on 

the left side of (31.7) vanish. In what follows, we assume that has 
been substituted for Um with a consequent change in Eqs. (31.6), and 
omit the primes. 

We adopt our earlier notation = Eh — and substitute (31.6) 
into (31.4); 

fi 

where the prime on the summation indicates that the term n = k is 
excluded from the summation. The last term on the right side of Eq. 
(31.8) is the Icn matrix element of dll/dt. 

Adiabatic Approximation. ^Pho group of Iil(;(s. (31.8) for all fc's is 
exactly equivalent to the Schrddinger ociuation (31 .1). We now estimate 
the order of magnitude of ajt by assuming that all the (piantities 
(an, cojbn, Uny dll/dt) that appear on the right side of (31.8) are constant in 
time. If further we assume that the system is in the state m at t = 0, 
we can put an = 5nm. We thus ol)tain 



k 9^ m 
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which is readily integrated to give 


k 9 ^ m (31.9) 


With the above approximations, Eq. (31.9) shows that the probability 
amplitude for a state other than the initial state oscillates in time and 
has no steady change over long periods of time even though H changes 
by a finite amount. If the change in H during the Bohr period for the 
transition m— is small in comparison with the energy difference 
between these two states, the transition is unlikely to occur. The change 
in amplitude of the state k after a long time is of the order of the ratio of 
these two energies. 




(iM^)(aH/a0 j 

Eh — Em 


(31.10) 


Coimection with Perturbation Theory. An exceptional situation 
arises when the hlamiltonian oscillates in time with a frequency nearly 
equal to one of the transition frequencies, say This is a case of 
resonance, and we expect from the discussion of Sec. 29 that even a very 
small change in H can produce appreciable changes in the amplitude a* 
over long periods of time, so that (31.10) is not valid. It is then no 
longer permissible to assume that the time dependence of dH/dt can be 
neglected, and^the passage from (31.8) to (31.9) is not justified. 

In order to consider this case more carefully, we assume that only a 
small part of H oscillates in time with an angular frequency o; that is 
close to Oihm' 

f)TI 

H = Hq + 7?' sin cx)tj — = wH' cos oif, 


where H' is small in comparison with Hq and both of these are constant in 
time. If then the dependence of an, coA,n, and on time is neglected and 
we put an = 5nm as before, Eq. (31.8) becomes 


ah ^ 


cos <ot 

rjt 

2hcJkm ^ 


This is readily integrated to give 




^k7n 




'Am 




(ufcw-hw)/ 




“h CO 




O^km — CO 


(31.11) 


This shows tliat the adiabatic approximation (31.10) breaks doAvn for 
Wftm = ±w, since then (31.11) increases steadily with the time. Tf is 
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close to +w, the first bracket term can be neglected and we can replace 
cj/wjfcm outside the bracket by +1 ; if o^km. is close to — w, the second bracket 
term can be neglected and we can replace by —1. In both cases, 
we see that (31.11) agrees with the perturbation-theory result given in Eq. 
(29.17). 

Discontinuous Change in H. As an introduction to the sudden 
approximation, we consider first a situation in which the Hamiltonian 
changes discontinuously from one form that is constant in time to another. 
Suppose that J? = Ho for ^ < OandH = Hi fori > 0, where 

Ho^n ~ HlVm, “ 

and the and are complete orthonormal sets of functions that are 
not necessarily discrete. The general solutions can be written 


- 

_ iE-nt 

= S^mVme * 


t < 0 
t > 0 


(31.12) 


where the a’s and ?/s are independent of the time. 

Since the wave equation (31.1) is of first order in the time, the wave 
function at all points in space must be a continuous function of the time 
at i = 0, although its time derivative is not. The 6’s are then readily 
expressed in terms of the o’s by equating the two solutions (31.12) at 
t = 0, multiplying by a particular t), and integrating over the coordinates: 

hm “ (31.13) 


The appearance of final states m that need not have the same energy as 
an initial state is a conseciuenco of the non-zei’o frequency Fourier com- 
ponents into which the suddenly changing Hamiltonian can be resolved 
(see Sec. 29). 

Sudden Approximation. The suddtm approximation consists in using 
10(is.(3l.l3) when the change in the Hamiltonian occnipies a very short 
hut finite interval of time /n. In order to mske an csi.imate of the error 
introduced in b,nj we consider* a problem that, while somewhat artificial, 
can (uisily bo solved formally. Suppose that H = Jf[) for ^ < 0, // = Hi 
for / > /n, and II = //< for 0 < / < ^o. The intermediate Hamiltonian, 
wliich is assumed to be constant in time, has a complete orthonormal set 
of (uiergy eigenfunctions; 

HiWk = 

The true solution c.an be expanded in terms of the w's with constant 
coefficients : 


^ = S^'kWkr- 


iEkt 

ii 


0 <t < if, 
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The continuity condition at ^ = 0 gives 

C]o = (31.14) 

In similar fashion, the continuity condition at i = io gives, with the help 
of (31.14), 

6m = • e * 

_ \{Ek — Em)U 

= SkSo’O-n^iUmUndTS^'nWtfiT • 6 * 

__ i(JSk—Evi)h 

= * '\undrdT' (31.16) 

where the primes denote a different set of coordinate variables of integra- 
tion. The closure relation (10.11) shows that the bracket in the last 
term of (31.15) becomes a product of 5 functions of the differences 
between primed and unprimed coordinates when = 0, in which case 
the expression for hm agrees with (31.13), as it should. 

The difference between the exact expression (31.15) for hm and the 
approximate expression (31.13) is measured by the difference between 
exp [^i{Ek — Ert^U/h] and unity. . This is small if io is small in compari- 
son with all the periods h/ (Eh, — Em) that correspond to the eigenfunc- 
tions k and m that appear when H changes. 

A useful validity criterion is that to be small in comparison with the 
periods associated with the initial motion, since new states of motion that 
have very much shorter periods (high energies) are excited with relatively 
small amplitudes. When the sudden approximation is useful, the error 
in bm (and hence in ^) is of the order of the ratio of U to a typical initial 
period. 

Transient Disturbance, An interesting special case of (31.1 6) is that 
in which initial and final Hamiltonians are the same {Hi = i?o, = Um) 
and the S3rstem is initially in a particular state n. Then if to is short 
enough to satisfy the validity criterion of the last paragraph, we can 
expand the exponential in the last member of (31.15) and retain only the 
first two terms. 


h 

I'm =: 


J j u'^Skw't,idk 

1 - {E, - E„) 

j J KSkw'kWk 

1 1 

1 

1 

r-1 


Un(h(lr' 


Und'^dr^ 


With the help of the closure relation, the orthogonality of u,n and w„ when 
m ^ n, the substitution E^Ur,, = HoUmf and Eq. (22.10), this can be 
reduced to 
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tlfn(^Hi H Q^tlnldT j Tth 71 ^31.10) 

Equation (31.16) can be generalized to a situation in which Hi depends 
on the time; in this case a result that is also correct to first order in to 

is obtained by replacing Hito by H4t. 

It should be noted that the expression (31.16) for bm can be useful 
even when Hi — Hq not small in comparison with Ho, provided that 
the general criterion for the validity of the sudden approximation is 
satisfied {to sufficiently small). On the other hand, the perturbation 
theory (Sec. 29) is useful when a small time-dependent addition to the 
Hamiltonian is applied for a long time. 

Disturbance of an Oscillator. As a simple example of the application 
of the approximation methods developed in this section, we consider a 
linear hannonic oscillator in which the position of the equilibrium point 
a{t) depends on the time. The Hamiltonian for this system is 

The instantaneous energy eigenfunctions are the harmonic-oscillator wave 
functions (13.13) centered at the point a(0, and the energy levels are 
unchanged : 

Unix) = NnHn[oiix — a )]^- En = (n + i)hcjio 

We suppose first that the equilibrium point moves slowly, and 
investigate the circjumstanccs under which the adiabatic approximation is 
applicable. If the oscillator is initially in its ground state (/i = 0), the 
time derivative of the Hamiltonian dll/dt = — Kix — a)d has a non- 
vanishing matrix element only with the first excited state. With the 
help of (13.18) this is found to be 

Substitution into (31.9) shows that the coeflicierit of the time-dependent 
factor in the amplitude of the first excited state has the magnitude 

^ (ift)4 a 

hcol {Km)^ i2ho)c/ m)^ 

This expression may be interpreted physically by noting that the denomi- 
nator is of the order of the maximum speed of a hypothetical classical 
oscillator that has the zero-point energy. Thus the adiabatic approxima- 
tion is good if the equilibrium point moves slowly in comparison with the 
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claasical-oscillator speed. It is easily seen that for the nth excited state, 
the equilibrium-point speed must be small in comparison with 1/n times 
the corresponding classical-oscillator speed. 

The sudden approximation can be applied to an oscillator in its ground 
state when the time required to move the equilibrium point from one 
steady position to another is small in comparison with l/ojc. If this point 
is displaced a distance a in the positive direction along the line of motion, 
application of (31.13) shows that the probability amplitude for the nth 
state after the displacement is 

— / Unix — a)er^^*^dx = ^ / 

TTj-. J -tc 

This integral is identical with the expression for An in Eq. (13.21), except 
for the sign of a, and has already been evaluated with the help of the 
generating function (13.10) for the Hermite polynomials. The earlier 
discussion (Sec. 13) shows that the states most likely to be excited are 
those that have a classical amplitude of oscillation that is of the order of 
the displacement a; this is in agreement with the corresponding classi(^al 
result. 

Problems 

1. A hydrogen atom in its ground state is pliiecd b(^tw(;eji th(5 plates of a eoiidensor. 
A voltage pulse is applied to the eoiidenser so as to prodiuio a hoitiogc^neouH oleetrie 

_ t 

field that has the time dependence: E = 0, i < 0; E = EoC t, t > 0.^ Kind the first- 
order probability that the atom is in the 2*Sf state (200) after a long tinu^ What 
is the corresponding probability that it is in one of the 2P staU'S? 

2. An altcnuiting voltage of angular Frecpieney o) > mv^/2h^ is applied to the 
condenser of Prob. 1, What is the probability per unit time for the hydrogen atom 
to make a transition from its normal stat(^ to an ionized state? AssnnK*, only for the 
purpose of this problem, that the olecdronic wave function for th(^ ionizc^d stab^ can be 
represented by a plane wave. 

3 . Kxtend Eq. (211.20) to the (uise in which a transition (^an examr oidy in third 
order of the perturbation. Assume that noiK^ of the intermediate stjites Inis the same 
energy as the initial and final stab^s. 

4 . Use the perturbation theory to calculate the di(T(M‘cutiaI colli.sion cross .s('ction 
for the US' — ► 2*Sf excitation of a hydrogen atom. IiitegraU^ this to obtain tlu^ total 
(!rosa section, and show that it becomes the expre^ssion giv(m in Sc'c-. 30 at high bom- 
barding energy. 

6. Use the perturbation tlieuiy to calculate the difTercntial (a)llisiou (U’oss section 
for the LS — > 2P excitation of a hydrogen atom. Show that tlui total (U'oss section 
becomes the expression given in Eq. (30.1 1) at high bombarding mua-gy. 

6. Discuss the statement that appears at the end of the next to the last paragraph 
of Sec. 31. In parthnilar, show physically why it need not lx* a sunhient condition 
for the applicability of the adiabatic approximation that the ecpiilibrium-point speed 
be small in comparison with the corresponding elassical-oscullator speed. 
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7. Under what circumstances is ^(0 =■ [exp a valid representation 

of the solution xp at time t in terms of the Hamiltonian H and the solution at time 
zero? Show that, in general, the operator Ht in the exponent cannot be replaced by 

Hdt\ Show, however, that HiU in Eq. (31.16) can be replaced hy Hidt to first 
order in U. 

8. A hydrogen nucleus of mass 3 is radioactive, and changes into a helium nucleus 
of moss 3 with the emission of an electron that has about 17,000 electron-volts energy. 
vShow that the sudden approximation can be applied to the extranuclear electron that is 
initially present in the hydrogen atom. Calculate the numerical values of the prob- 
abilities that the resulting helium ion is found in its l^Sf, 2S, and 2P states if the hydro- 
gen atom is initially in its IS state. 



CHAPTER IX 

IDENTICAL PARTICLES AND SPIN 


The quantum-mechanical theory of particles presented thus far is 
deficient in three respects. First, whenever two or more particles are 
described at once, like the electron and proton of the hydrogen atom 
(Sec. 16) or the incident and atomic electrons in an inelastic collision 
(Sec. 30), it is assumed that the particles can be distinguished from each 
other. This is .a valid assumption in the first example, since electrons 
and protons possess quite different masses and electrical charges. In 
the second example, however, there is no obseiwable difference between 
the incident and atomic electrons, and the consequences of this identity 
should appear in the formalism. The second defect of the theory is the 
omission of an intrinsic spin angular momentum^ or spm, which is actually 
possessed by the particles found in nature.^ Third, no mention has as 
yet been made of the special theory of relativity, wliich is expected to 
affect the theoretical description of particles that move with speeds close 
to that of light. 

The ways in which the first two of these defects can he remedied arc 
described in this chapter, and illustrative examples are discussed. 
Relativistic effects are taken up in C^hap. XII. 

32. IDENTICAL PARTICLES 

Identical particles cannot be distinguished by means of any inherent 
property, since otherwise they would not be identical in all respects. 
In classical mechanics, the existence of sharply definable trajectories for 
individual particles makes it possible in principle to disi iiiguish between 
particles that are identical except for their paths, siiu^e (ui(*h particle <^an 
be followed during the course of an experiment. In (luani-iini mechanics, 
the finite size and the spreading of the wave pac^kets t-hat can (lcs(Tihe 
individual particles often make it impossilile to dislinguish between 
identical particles because of their positions, espocdally if they interact 
with each other to an apprecuable extent. This is t.riu^ of the electrons 
in a single atom, where we have seen that the (l(^sc.ri|)tioii in terms of 
moving wave packets breaks down completely. 1 I()W(^v(M’, t.h(‘ electrons 
of different atoms that are well separated from eiivh oth(n* may, to good 

^ Spin was first diHOoverod in connection with electrons, by Cl. M. Uhlcnbe<'k untl 
S, Goudsmit, Naturwiaa.j 13, 963 (1925); Nature^ 117, 264 (1926). 
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approximation, be regarded os distinguishable. This section considers 
some of the effects of identity on the quantum-mechanical treatment of 
systems of two or more particles. Other effects that involve the spin 
explicitly will be taken up in the remainder of this chapter. 

Physical Meaning of Identity. The impossibility in principle of dis- 
tinguishing between identical particles in most quantum-mechanical 
problems can lead to effects that have no classical analogue. As an 
example, we compare the elastic collision of two identical particles that 
have a particular interaction between them, with the collision of two 
different particles that have the same interaction between them. 

In a classical treatment, there is no difference of principle between the 
results of these two experiments, since it is possible to distinguish between 
the incident and struck particles in the first case as well as in the second. 
In practice, however, this distinction would usually be made only in the 
second experiment. Thus, according to classical mechanics, the meas- 
ured differential cross section in the first experiment is equal to the sum 
of the corresponding cross sections measured for the incident and struck 
particles in the second experiment. In the corresponding quantum- 
mechanical situation, the identical particles in the first experiment cannot 
be distinguished by means of their trajectories, since they cannot be well 
localized without interfering with the scattering process. Thus the dis- 
tinction between incident and struck parti(‘iles has no physical significance, 
and the simple connection between the results of the two experiments that 
is found in the classical case need not exist. 

We use the word idmtical to describe particles that can be sub- 
stituted for each other under the most general possible circumstances 
with no change in the physical situation. Identical particles can in some 
cases be distinguished from eatih other, as when their wave packets do 
not overlap. Another ciise, discussed more fully in Sec. 33, arises when 
each of the particles possesses an intrinsic spin angular momentum, 
which is a constant of the motion in a particular collision. Then since 
the component of the spin along some axis is assumed not to change dur- 
ing this collision, the particles c.an be distinguished if they have different 
spin components, iiosults of this kind must, of course, be a consequence 
of the formalism tliat we now set up. 

Symmetric and Antisymmetric Wave Functions. The Schrodinger 
wave eciuation for n identic, al particles is 

. . . ,n-t) = 7/(1, 2, . . . . . . ,n; t) (32.1) 

where each of the numbers represents all the coordinates (positional and 
spin) of one of the particles. The Ilamiltoiiian H is symmetrical in its 
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arguments, since the identity of the particles means that they can be 
substituted for each other without changing H, 

There are two kinds of solutions ^ of Eq. (32.1) that possess symmetiy 
properties of particular interest. A wave function is symmetric if the 
interchange of any pair of particles among its arguments leaves the wave 
function unchanged. A wave function is antisymmetric if the interchange 
of any pair of particles changes the sign of i/'. We now show that the 
symmetry character of a wave function does not change in time. If 

is symmetric at a particular time then H\ps is also symmetric, and 
(32.1) states that d-^n/dt is symmetric. Since ^p8 and its time derivative 
are symmetric at time i, -ps at an infinitesimally later time i + d/ is given 
by ^5 + {d\ffs/dt)dtf and is also symmetric. Such a step-by-step integra- 
tion of the wave equation can, in principle, be continued for arbitrarily 
large time intervals, and \[/a is seen to remain symmetric always. In 
similar fashion, if xJ/a is antisymmetric at any time, and hence 
dxl/A/dt are antisymmetric, and integration of the wave eciuation shows 
that \I/A is always antisymmetric. 

The foregoing proof is not altered if H and ^ have as their arguments 
the coordinates of two or more different groups of idonticnl particles; 
thus a wave function that is initially set up to be symmetric or anti- 
symmetric in the coordinates of each identical-particle group always 
retains this character. This makes it possible for the dilTerent groups of 
identical particles found in nature to have definite symmetry cliai’actei'S, 
and this is actually found to be the case. Electrons, protons, and neu- 
trons are the only material particles for which the experimontal evidcmcui 
is unambiguous, and each of these kinds of parti(!les is described by anti- 
symmetric wave functions. 

Construction from Unsymmetrized Functions. We now show how 
\l/8 or xpA can be constructed from a general unsyinm(itriz(ul solution i// 
of Eq. (32.1). If the arguments of ^ are permuted in any way, the result- 
ing function is a solution of (32.1). 'J’hat this is true follows from th(^ 
observation that the same permutation applied throughout E(|. (32.1) 
does not impair its validity, sim^e it (corresponds simply to a nhibeling 
of the particles; then since II is symmetric,, the permul(Ml II is the same 
as the original H, and the resulting ccpiation is the same ns (32.1) for 
the permuted i^. In this way n! solutions can lie obtained from any one 
solution, each of which corresponds to one of the /t! permut at ions of the n 
arguments of xp. It is evident that any linear combination of these fumv 
tions is also a solution of the wave ecpiation (32.1). 

The sum of all these functions that are linearly indcqicndeiit is a sym- 
metric (unnormalized) wave function xpa, sin(^e the int.(M'(duinge of any 
pair of particles changes any one of the component functions into anotluu’ 
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of them and the latter into the former, leaving the entire wave function 
unchanged. An antisymmetric unnormalized wave function can be con- 
structed by adding together all the permuted functions that arise from 
the original solution by means of an even number of interchanges of pairs 
of particles, and subtracting the sum of all the permuted functions that 
arise by means of an odd number of interchanges of pairs of particles in 
the original solution. It is apparent that a nonvanishing antisymmetric 
wave function cannot be formed from a solution that is unaltered by the 
interchange of any pair of particles. 

In the event that the Hamiltonian does not involve the time, station- 

ary solutions 1 ^( 1 , 2, . . . ,n; t) = ^^(1,2, . . . ,n)e ^ can be found, 
where 

[if(l,2, . . . ,n) - A>.(1,2, . . . ,n) = 0 

The earlier discu>ssion shows that the solutions derived from any u by 
means of permutations of its arguments are degenerate with the original 
u] this is called exchange degeneracy. When = 2, the 21 = 2 permuta- 
tions result in w(l,2) and the symmetric and antisymmetric com- 

binations are obtained by taking the upper and lower sign, respectively, in 

7^1,2) + 74(2,1) (32.2) 

When 11 = 3, the 3! = 0 permutations yield 74(1,2,3), 14(2,1,3), 7i(3,2,l), 
74(1,3,2), 74(2,3,1), and 74(3,1,2); the symmeti'ic- and antisymmetric com- 
binations are 

[74(1,2,3) + 74(2,3,1) + 74(3,1,2)] 

± [74(2,1,3) + 7/(1, 3, 2) + 74(3,2,1)] (32.3) 

with the upper and lower sign, respectively. 

All th(‘ energy eigenfun/tions that ai’o exciiango degenerate with 
74(1,2) can he formed from the two solutions (32.2). When n = 3, 
however, there are four liiioaiiy indc^peudent eigenfunclions that cannot be 
formed from the two functions (32.3). Tliese additional solutions, which 
always appear when n > 2, can he chosen so as to possess permanent 
symmetiy characters that resemble hut are somcuvhat more complicated 
than those of the symmetric and antlsymmeti’i(; solutions. However, 
they do not appear to descrihe partich^s found in nature. 

Distinguishability of Identical Particles. It is to be expected that 
the result of an experiment is independc^nt of the symmetry (iiameter of 
the wave funcl-ion if tlie (coordinates of tlie pariiccles do not ovcciiap. This 
corresponds to a situation in which tlu' particles can he distinguished by 
inccans of theeir positions (or ilu'ir s])in (components) eveen though tlicy are 
identical. Such a situation impli(cs, in th(c {cas(c of two parti(cles, that the 
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wave function u(l,2) is different from zero only when the coordinate 1 is 
in some region A, the coordinate 2 is in a region B, and A and B have no 
common domain. 

The coordinate probability density associated with the wave function 
m( 1,2) is |m( 1,2)1* and the densities associated with the symmetrized 
wave functions (32.3) are 

lw(l,2) ± m( 2,1)P = |w(l,2)l* + !m( 2,1)1* + 2fi:e[M(l,2)M(2,l)] (32.4) 

where Re denotes the real part of the expression in brackets. If now 
w(l,2) vanishes whenever 1 is not in A and 2 is not in B, and A and B do 
not overlap, the bracket term is zero everywhere, and (32.4) becomes 
lii(l,2)|= + |u(2,l)l^ 

Thus the density associated with either of the ssmimetrized wave 
functions (32.2) is the sum of the densities associated with w(l,2) and 
m( 2,1) separately. This is precisely the result that will be obtained if 
the particles are not identical but no attempt is made to distinguish 
between them in performing the experiment. Thus the interference 
effects between exchange-degenerate wave functions, represented by the 
bracket term in (32.4), disappear when the coordinates of the particles 
do not overlap. 

The Exclusion Principle. In many problems, a useful zero-order 
approximation can be obtained by neglecting the interactions between the 
particles that make up the system under consideration. The approxi- 
mate (unperturbed) Hamiltonian is the sum of equal Hamiltonian func- 
tions for the separate particles 

Hod, 2, . ■ ■ ,n) = HS(1) -t- HS(2) -h • ■ • + Kin) (32.6) 
and the approximate energy eigenfunction is a product of ono-particl(J 
eigenfunctions of HJ 

m( 1,2, . . . ,n) = !;„(l)Dfl(2) . . . »>,(w) 

E = Ea + Ef+ ■ • • + E, (32.(5) 

maM) = EavM, etc. 


If the particles are electrons, an antisymmetric, wave f uiudion must be 
constructed from the u given by (32.6). This is most oiisily expressed 
as a determinant of the a’s: 


Ui(l,2, . . . n) = 


«<.(!) 

v.m 

■ ■ • vAn) 

r^(l) 

V0(2) 

■ ■ ■ 

v,(l) 

v,{2) 

■ ■ ■ v,(n) 


(32.7) 


The (unnormalized) Ua given in (32.7) is clearly an antisymmetric solu- 
tion of the approximate wave equation (Ho — E)ua = 0. 
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Equation (32.7) has the interesting property that it vanishes if two 
or more of the i^'s are the same. This is a special case of the general 
result stated earlier that an antisymmetric wave function cannot be 
constructed from a solution that is unaltered by the interchange of any 
pair of particles. Thus the approximate Hamiltonian Ho has no solu- 
tions for which there is more than one electron in any one of the states 
oLj Pj » . , j V. This result is known as the exclusion principle and was 
first postulated by Pauli ^ as an explanation of the periodic system of the 
chemical elements (see Sec. 38). 

Connection with Statistical Mechanics. The unsymmetrized zero- 
order solution given in Eq. (32.6) can be used to construct a symmetric 
as well as an antisymmetric wave function. Such a symmetric (unnor- 
malized) function is easily seen to be the sum of all different permutations 
of the numbers 1,2, . . . ,n among the one-particle eigenfunctions 
Vaj V 0 J . . , , This wave function is unique, and can be specified 
simply by stating how many particles are in each of the states a, . . . . 
In the same way, an antisymmetric wave function can be specified by 
stating how many particles are in each state. The fundamental statis- 
tical difference between particles that are described by antisymmetric 
and by symmetric wave functions is that the number of the former type 
that can occupy any state is limited to 0 or 1, whereas any number 
(0, 1,2, . . .) of the latter type of particles can occupy any state. 

The treatment of aggregates of large numbers of noninteracting (or 
weakly interacting) particles for which the states can be enumerated in 
these two ways forms the subject matter of quantum statistical mechanics. 
Particles that are described by antisymmetric wave functions are said to 
obey Fermi-Dirac statistics^ and particles that are described by symmetric 
wave functions obey Einstcin-Bose statistics/^ 

While all the material particles whose statistics arc definitely known 
(electrons, protons, and neutrons) obey Fermi-Dirac statistics, the Ein- 
stein-Bose statistics are not without physical interest. In the first place, 
light quanta or photons, in so far as they can bo treated as particles, obey 
Einstein-Bose statistics even though they cannot usefully be described by 
means of wave functions. Second, certain aggregates of particles that 
are sufficiently tightly bound so that they can be regarded as “particles,” 
are described by symmetric wave functions. 

For example, the nucleus of a helium atom is made up of two protons 
and two neutrons, which are strongly bounfl together. If we consider a 
number of helium nuclei that interact with each other weakly enough so 

1 W. Pauli, Zaiis, f, Phymk, 31, 7(56 (1926). 

* See, for example, U. C. Tolinan, ‘‘The PrinciplcH of SUitiatioftl Mechanics,” 
Chap. X (Oxford, New York, 1938). 
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that the changes in the internal motions of the nuclei can be neglected, 
we can see that the motions of the centers of gravity of the nuclei can be 
described approximately by a symmetric wave function. The inter- 
change of a pair of helium nuclei can be thought of as the resultant of 
four interchanges: two pairs of protons and two pairs of neutrons. Since 
the actual wave function is antisymmetric in all the protons and in all the 
neutrons, the resultant of these four interchanges leaves the approximate 
wave function unchanged. By an extension of this argument, we see 
that weakly interacting ^^particles” (nuclei, atoms, or molecules) obey 
Einstein-Bose statistics when each of them consists of an even total 
number of electrons, protons, and neutrons, and obey Fermi-Dirac 
statistics when each consists of an odd total number of these particles.^ 

Collisions of Identical Particles. When the only forces acting on two 
particles result from their mutual interaction, the over-all motion can be 
separated into motion of the center of mass of the two particles and 
motion of the particles relative to each other, as discussed in Secs. 16 and 
18. It is apparent that an interchange of two identical particles does not 
affect the position vector of the center of mass [which is i(ri + 12 ) since 
the particles have equal masses], but changes the sign of the relative 
position vector r (= ri — r 2 ). We postpone consideration of the spins of 
the particles until the next section, and see now what olTc(d. symmetry 
or antisymmetry of the space part of the wave function luis on the elastic 
scattering of a particle by another that is identical with it. 

The asymptotic form of the unsyrnmetrized scattering wave function 
in the center-of-mass coordinate system is given by Eq. (18.10). 

uir) ^ (32.8) 

r— > ao 

where Tydj<j) are the polar coordinates of the relative position vector r. 
Since the polar coordinates of the ve(;tor — r are r, tt — 0, <^ + tt, the 
asymptotic forms of the >symmetric and antisymmetric wave functions 
formed from (32.8) are given by 

(^WAa + [/(^,C?!>) ± /(tt — (^ + 7r)]r“'r'^^ (32.9) 

with upper and lower signs, respectively. 

From the discussion of Sec. 18, it follows that tlio differontial sc.atter- 
ing cross section in the center-of-injuss (‘.oordinain system is the scpiare of 
the magnitude of the bracket term in (32.9) : 

<T{dj<l>) = |/(0,<#>)|^ + IKtt — 6, <t> + 7r)l- 

± 2RcU{d,<l>)J(ir -e,<l> + tt )] ( 32 . 10 ) 

^ A more rigorouH treatment that leads to the same (conclusion has b(‘('n ^ivtui by 
P. Ehrenfest and J, 11. Qppenheimer, Phys, Kev.j 37, 333 (11)31). 
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The normalization adopted here can be justified by noticing that in the 
classical limit, where the identical particles are distinguishable and the 
last (interference) term in Eq. (32,10) drops out, <r{6j<t>) becomes just 
the sum of the differential cross sections for observation of the incident 
particle (|/(^,<#>)|’^) and of the struck particle (|/( 7 r — 0, <#> + 7r)p), as it 
should. 

In the usual case, for which / is independent of <i>, it is apparent that 
the scattering per unit solid angle is symmetrical about 6 = 90° in the 
center-of-mass ('coordinate system. It is easily seen from Eq. (18.7) with 
7 = 1 that the s(^attering per unit angle (not per unit solid angle) in 
the laboratory coordinate system 

(ro(0o) sin 00 = 4 cos 0o sin 0o {|/(20())|^ 

+ \f{T - 20o)|^ ± 2EcLf(20o)/(7r - 20o)]] 

is symmetrical about 00 = 45°. 

33. SPIN ANGULAR MOMENTUM 

The treatment of identical particles presented in the preceding section 
must now be supplemented by inclusion of the spin angular momenta of 
the particles. It was shown in Sec;, 24 that the operator M, which has 
the properties associated with an angular momentum, can be represented 
by matrices in an infinite number of ways. For each representation, 
M‘-* and one component of M, say can l)e diagonalized; their eigen- 
values arc j{j + l)/i‘'^ and the series jh, (i 1)^, . . . , — jh^ respec- 
tively, where 2j is zero or a positive int(;ger. If the expression for M 
in terms of I he position and momentum of a parti(;lc (r X p) is abandoned, 
then M- (^an c-ommuic; with the Hamiltonian for that, particle. In this 
case, M“, and hence' j, is a c.onstant of tlio mot ion and charac.terize.s the 
particle for all tinu^; the (;()rresp()nding intrinsic; angular momentum is 
(;all(^d the spin of tlic; i)arti(;le. We shall r(;pla(;e M by S and j by s in 
dealing with the spin. 

Connection between Spin and Statistics. All the material particles 
whose spin is d(;finit(;ly known (el(;(;tr()ns, protons, and neutrons) have 
,s = Aggregat(;s of particles that are siitlic.ient.ly tightly bound can 
be r(;giirded as ‘'parti(;l(;s,^^ and can be {;harac.t(;i'ized by d(;finite magni- 
tude's of lh(;ir total internal angular monumt-a, so long as thedr internal 
motions and tlu; r(;lativc spin orientalious of l-heir compcjmuit particles 
arc; not signifi(;ant.ly allectc'd by tlu; inl.(;ra(;t.ions b(;tw(;en aggregates. 
This is (;xa(;tly analogous t-o the situat.ioii with regard to the statisti(;s 
oh(;y(;d by the aggr(;gat(;s, (lisciiss(;d in the pr(;ceding section. 

The tr(‘at-ment of tlu; additiem (3f angular inom(;nta, presented at the 
end of Sec, 2L, can be generalized to give the ])ossible magnitudes of the 
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total internal angular momentum, which we call the spin, of any aggre- 
gate of fundamental particles. If the aggregate consists of n particles, 
each of which has 5 = i, and the internal orbital angular momentum of 
these particles is ignored, the total s can be any integer from 0 to if n 
is even, or can vary by integer steps from i to if n is odd. The total 
orbital-angular-momentum quantum number can be shown to be an 
integer or zero in general;^ its inclusion extends the maximum value of s 
for the aggregate, but does not alter the conclusion that s is zero or an 
integer if n is even, and is half an odd integer if n is odd. 

We see then that for the known fundamental particles and for aggre- 
gates of them that have a definite spin, there is a unique connection 
between the spin and the statistics. Particles or aggregates that have 
zero or integer spin are described by S3nnmetric wave functions and obey 
Einstein-Bose statistics, and particles or aggi’egates that have half-odd- 
integer spin are described by antisymmetric wave functions and obey 
Eermi-Dirac statistics. There is some theoretical reason, based on 
relativistic quantum mechanics, ^ to believe that this connection also 
holds for other fundamental particles whose existence is suspected but 
whose spin and statistics have not yet been determined (mesons, 
neutrinos). 

Spin Matrices and Eigenfunctions. The spin can be included in the 
formalism developed in Sec. 32 by having each of the numbers 1,2, . . . ,n 
that appear as the arguments of ^ and u represent a spin coordinate as 
well as the three space coordinates of that particle. The spin coordinate 
differs from the space coordinates in that it takes on only 2s + 1 values 
for a particle (or aggregate) of spin s, instead of the infinite number of * 
values that are taken on by each space coordinate. Thus the ^^spin 
space” consists of a finite number of points. The spin wave function of a 
single particle is completely determined by the specification of 2.s + 1 
numbers, whereas the space wave function involves the specification of a 
continuously infinite set of numbers (which is equivalent to a continuous 
function of the space coordinates). 

A convenient set of orthonormal one-particle spin functions is pro- 
vided by the normalized eigenfunctions of the and Mz matrices given 
in equations (24.15). These eigenfunctions are (25 + l)-row, one-column 
matrices that have zeros in all positions except one. For example, if 
5 = the four spin eigenfunctions are easily seen to be 

^ The work of Secs. 14 and 24 shows that this is true for nonintcracting particles 
that move in central force fields, and the result turns out not to be affected by particle 
interactions. 

* W. Pauli, Phys, Rev,, 68, 716 (1940). 
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and correspond to Sz eigenvalues of fft, ihj and — l-ft, respectively. 
The orthonormality is demonstrated by multiplying the Hermitian 
adjoint of one spin function into itself or another function 



(0 10 0 ) 


0, etc. 


with the help of the usual rule for matrix multiplication. 

Symmetric or antisymmetric many-particle wave functions can be 
constructed from unsymmetrized solutions that include the spin by fol- 
lowing the procedure outlined in the preceding section. It is sometimes 
convenient to choose the unsymmetrized solutions to be eigenfunctions 
of the square of the magnitude of the total spin of the identical particles 
(Si + S 2 + * • • + Sn)^ and of the z component of this total spin 
Si* + aSi 2 * + ■ • • + Snz- These quantities are constants of the motion 
if the Hamiltonian does not contain interaction terms between the spins 
and other angular momenta. In addition, such functions are often useful 
as zero-order wave functions when the spin interactions are weak enough 
to be regarded as a perturbation. There is no loss of generality in choos- 
ing the unsymmetrized solutions in this way, since in the absence of spin 
interactions any solution can be expressed os a linear combination of total- 
spin eigenfunctions. 

Collisions of Identical Particles. The effect of spin on the collision of 
two identical particles (or aggregates) can now be taken into account if 
the interaction between the particles docs not involve the spin. Since 
each particle has 2a + 1 spin eigenfunctions, there are altogether (2s + 1)® 
independent spin functions for the pair, each of which is a product of 
one-particle spin functions. 

Any (2s + 1)- linearly independent combinations of these products 
can be used in place of them. These are conveniently divided into three 
classes. The first class consists of products of one-particle functions in 
which both particles are in the same spin state with S, value mh: 

Vi{m)Vi{m), —s ^ m g s 

where the subscript specifics which of the particles is in each state; 
there are evidently 2s -1- 1 such states. The second chiss consists of 
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sums of products 

+ 2;i(m")v2(m'), m' m" 

There are s(2s + 1) of these states. The third class consists of dif- 
ferences of products 

Again there are s(2s -h 1) of these. 

The first two classes are clearly symmetric in an interchange of the 
spin coordinates of the two particles, and the third class is antisymmetric 
in such an interchange. Thus the total of (2s + 1)^ states can be divided 
into (s + l)(2s + 1) symmetric and s(2s + 1) antisymmetric states. 
Associated mth the symmetric spin states must be a symmetric space 
state if s is an integer (symmetric total wave function), and an anti- 
symmetric space state if s is half an odd integer (antisymmetric total 
wave function). Similarly, the antisymmetric spin states multiply 
antisymmetric space states if 25 is even, and multiply symmetric states 
if 2s is odd. We see then that if all the spin states are equally likely to 
appear in a collision,^ a fraction (5 + l)/(2s + 1) of the collisions will be 
described by the wave function (32.9) with the upper sign, and a fraction 
5/(2s + 1) will be described by (32.9) with the lower sign, if 2s is even. 

This and the similar result for 2s odd can be summarized by rewriting 
Eq. (32.10) 

<r(9) = 1/(5) 1^ + |/(x - 9)1“ + 2 Kc[/(9)7(t - 9)] (33.2) 

where / is assumed to be independent of 0. 

Equation (33.2) can also be derived by making use of the earlier 
observation that particles that have different spin components are dis- 
tinguishable, in which case the interference term in (32.10) disappear. 
This occurs in a fraction 2s/ (2s + 1) of the collisions. In the remaining 
fraction l/(2s + 1) of the collisions, the particles have the same spin 
component, and the symmetric or antisymmetric space state (upper or 
lower sign in the interference term) must be used according as 2s is even 
or odd. 

Electron Spin Functions. In the remainder of this chapter we con- 
sider only electron spin functions (s = ^). The spin matrices are given 
by the first line of Eq. (24.15), and may be written as S = where 

- C i)' ■ C ~o)' ■ C -) 


1 See footnote 1, pn^o 23C. 
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are called the Pauli spin matrices.^ The normalized eigenfunctions of Sz 
may be written in analogy with Eqs. (33.1) as 



and have eigenvalues and — respectively; they are both eigenfunc- 
tions of with the same eigenvalue 

Since we shall have occasion to write down products of spin functions 
for different electrons, it is convenient to abbreviate the notation as 
follows: 

== (H 1-+), etc. 


where the first particle has the eigenvalue ih for the second has the 
eigenvalue — for ^ 2 *, etc. Si has no effect on the spin functions of 
any but the firet particle. 

The following operational rules are easily obtained from (33.3) and 
(33.4): 


<J^»(+) “ (”), <7’t/(+) = 

<^x(“) = ( + ), 0’l/(”-) = — '^‘( + )» 


tra(+) = (+) 

<r«(-) = -(-) 


(33.5) 


There are four linearly independent spin functions for a pair of elec- 
trons: (++), (+—), (— +), ( )• These are orthonormal, since the 

one-particle spin functions (33.4) are orthonormal. As remarked earlier, 
it is often convenient to regroup these functions into combinations that 
are eigenfunctions of (Si + So)^ and Su + S 2 z. It can be verified with 
the help of (33.5) that the following four combiiiatioriH are orthonormal 
and have the indicated eigenvalues: 


(Si + S.,)’-* Su + S2. 


( + +) h 

‘^ ■*[(+“) + (-+)] 2fi^ 0 

(--) -h 

2'i|(H^-) - (-+)] 0 0 


(33.6) 


It is interesting to note that the first thrc^e of the two-particle spin 
functions (33.6) together behave in all rc^spects like a single “particle/’ 
of spin s = 1, and the last of the spin funct.ions (33.()) behaves like a 
single “particle” of spin s = 0.*-^ Nob only do they have the proper 
eigenvalues of the scpiare of tlie magnitude of ih(^ i.oiul spin and the z 
component of the total spin, l)ut the result of opcu'ating on the triplet 
spin func-tion with the x or y components of tlu^ 1 otal s]7in is in agreement 

1 W. Pauli, ZcilH. S Phymk, 43, 001 (1027). 

2 Tlu^ lirat thn'o hIhU'h aro (^alhul a iripld atul lh(^ last a nhiylct. lii tlu^ olil (|iianiuni 
theory, the trii)l(a eorre.spondH to parallel eliu't.roii spins and the siiiKlet to aiitiparallel 
aphiB. 
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with the corresponding matrices in the second line of Eq. (24.15). This 
provides an example of the addition of angular momenta; according to 
Sec. 24, the combination of two systems of angular momenta results in a 
system of angular momentum either 1 or 0. 

The HeUtim Atom. The ground state of the helium atom was con- 
sidered from the point of view of the variation method in Sec. 27. We 
now consider the ground and first excited states of helium with the help 
of the somewhat simpler first-order perturbation theory of Sec. 25; the 
symmetry effects of the spins of the two electrons are taken into account, 
although spin-dependent forces are neglected. We use products of 
hydrogenic wave functions Unim (with Z = 2) as the unperturbed eigen- 
functions of the problem, and are interested in classifying the states 
according to symmetry and spin properties rather than in obtaining 
accurate energy levels. 

In spectroscopic notation, the ground state of helium is the state: 
both electrons are in the hydrogenic state Wioo. Since this space state is 
symmetric, the spin state that multiplies it must be the antisymmetric 
singlet given as the last of the functions (33.6), for which the total spin 
is zero. 

The space part of the first excited state of helium is eightfold degener- 
ate in the zero-order approximation. The spectroscopic configurations 
are ls2s and ls2p. Apart from electron exchange, the first state is non- 
degenerate and the second is triply degenerate (because of the three 2p 
states); the exchange degeneracy doubles the number of states, since 
either electron can occupy the Is state and the other the 2s or 2p state. 
In order to simplify matters, we consider here only the doubly (exchange) 
degenerate ls2s state; it is not difficult to show that the ls2p states can 
be treated separately (see Prob. 7). 

The perturbation energy is the electrostatic repulsion between the 
electrons eV^i 2 , and the unperturbed states are Wi()()(ri)iA 2 oo(r 2 ) and 
'Wioo(r 2 )t^ 2 oo(ri). The spin need not be considered explicitly at this point 
since the spin-dependent forces are neglected; appropriate spin functions 
will be multiplied in later to make the entire wave function antisym- 
metric. The matrix of the perturbation for these two states has the 
structure of (25.16) and can be written 


where 



J = 
K = 


j j ‘Wioo(ri)u2oo(r2) 
J J Wioo(ri)W20o(r2) 


— 'Wioo(ri)i/2oo(r2)dridr2 
' 12 

— ‘WlOo(r2)tt20o(ri)dTidT2 


(33.7) 


(33.8) 


J is often called the direct or Coulomb energy^ and K the exchange energy. 
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Application of the diagonalization technique of Sec. 25 (see the treat- 
ment of the first-order Stark effect in hydrogen) shows that the eigen- 
values of the perturbation (33.7) are / + iC and J — K] they correspond 
to the normalized eigenfunctions 2 “*[wioo(ri)w 2 oo(r 2 ) + iAioo(r 2 )’W 2 oo(ri)] 
and 2“*[t/i()o(ri)u2oo(r2) — Wioo(r 2 )w 2 oo(ri)], respectively. Since the first 
of these is a symmetric space function, it must be multiplied by the anti- 
symmetric singlet spin function. Similarly the second, which is an anti- 
symmetric space function, must be multiplied by one of the symmetric 
spin functions that make up the triplet in (33.6). Since K turns out to be 
positive, the singlet spin state has a substantially higher energy than the 
triplet spin states. This is not due to a spin-rlependent interaction, but 
to a coupling between the spins and the electrostatic interaction that is 
introduced by the exclusion principle (use of antisymmetric wave 
functions). 

Spin Functions for Three Electrons. In the treatment of exchange 
scattering from helium that is given in the next section, we shall require 
eigenfunctions of the total spin of three electrons that are analogous to 
those given in Eqs. (33.6) for two electrons. We can regard three elec- 
trons as 1 + 2 electrons, in the sense that we can combine an electron 
(s = -g-) with the triplet two-electron function (s = 1) and with the singlet 
function (s = 0). In the first case, the results on addition of angular 
momenta, given in Sec. 24, show that we should get two groups of spin 
functions for the three electrons that correspond to s — i and s = f ; in 
the second case we should get a single group of three-electron spin func- 
tions that correspond to s = ■^. We thus expect one (juartet group of spin 
states (s = ^) and two distinct doublet groups of spin states (s = ^), 
or a total of 4 + 2 + 2 == 8 individual three-oloctron spin states. These 
must of course be expressible as linear combinations of the 2^ = 8 
products of one-electron spin functions. 

It is not difficult to show that the following eight combinations are 
orthonormal and have the indicated eigenvalues: 

(Si + Sa + S,)® 


(+ + +) 

■•."A* 

Ih 

3-‘[( + + -) + (+-+) + (- + +)] 



3-‘[(--+) + (- + -) + ( + --)] 

■'.fA* 


( ) 

•»/A* 

-\h 

6-*[( + + -) + (+-+) - 2(- + +)] 

3A« 

\h 

6“‘[(--+) + (- + -)- 2(+--)] 

JA» 


2-*[(+ + -) - ( + -+)] 

w 

ih 

2-»[(--+) - {- + -)! 

iA> 



The first four (quartet) states are symmetric in the interchange of any 
pair of particles. The division of the four doublet states into two pairs 
is arbitrary, and is done hqre in such a way as to make the first pair of 
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doublet states symmetric in the interchange of particles 2 and 3, and the 
second doublet pair antisymmetric in 2 and 3. As they are written, the 
doublets have no symmetry with respect to interchanges of the other pairs 
of particles. 


34. REARRANGEMENT COLLISIONS 

Cross sections for elastic and inelastic collisions of electrons with 
hydrogen atoms were calculated in Sec. 30 by means of perturbation 
theory, under the assumption that exchange of the incident and atomic 
electrons can be neglected. In this section, we consider the effects of 
electron exchange taken together with spin and the exclusion principle, 
but continue to use perturbation theory, which is most useful for high- 
energy collisions.^ We first consider a general rearrangement collision 
by means of the Born approximation of Sec. 26, then show the connection 
between this method and the time-dependent perturbation theory of 
Sec. 29, and finally apply the theory to exchange collisions of electrons 
with hydrogen and helium atoms. 

Notation for Rearrangement Collisions. A general binary rearrange- 
ment collision can be described as an event in which a system A in state 
m collides with a system B in state n, and systems C in state s and D in 
state t emerge. It is assumed that the same particles make up the sy^s- 
tems A,B as make up the systems C,D (no particles appear or disappear 
and no photons are involved), although the particles are rearranged dur- 
ing the collision. We use the letters AjB,CjD to denote all the internal 
coordinates (including spins) of the respective systems, and r^rf to 
denote the vectora that connect the centers of mass of the systems 

A,B and C,D, respectively, and M^b = and 

to denote the reduced masses associated with the relative motion before 
and after the collision. The entire calculation is peiformed in the center- 
of-mass system; the transformation to the laboratory system can be 
effected by means of the general results of Sec. 18. 

It was shown in Sec. 32 that a calculation of this type can be carried 
through as though the particles are distinguishable. At the end, a linear 
combination of the exchange-degenerate wave functions is formed that 
has the proper symmetry in each group of identical particles. The sym- 
metrization will be left for the specific examples given near the end of this 
section. For the general problem considered here, we obtain only an 
approximate unsymmetrized wave function. 

^ For a discusaioii of other methods applicable to lower energy collisions, see N. F. 
Mott and H. S. W. Massey, "'The Theory of Atomic Collisions*' (Oxford, New York, 
1933). 
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We wish to solve the wave equation 

{H -E)yp = 0 (34.1) 

where the Hamiltonian can be written in either of two ways: 

H = H^ + Hi, = Hci + HU (34.2) 

The unperturbed Hamiltonians for the initial and final systems are 


H^ = Ha + Hi + To., 


= H. + Hi + Toi, 



— — V*k 

2il/cd 


(34.3) 


where the repreaent the kinetic energy operatora for the relative 
motions in the center-of-masa system. The unperturbed states of the 
initial and final systems are (known) solutions of the wave equations 

{Ha “ Eam)Uam{A) = 0, {Hi “ ^ft„)i^6n(5) = 0 . .. 

(ffc - E,,)Uo.{C) = 0, {H, - Eadu,t{D) = 0 

The interaction terms and H^^ are regarded as small perturbations. 

It is always possible to expand the exact solution in the complete 
orthonormal set of functions Vca{C)udt{T>)^ where the expansion coef- 
ficients are functions of the relative coordinate Tod* 


yj/ — ^ Uoa{O^Udt{E)7>gi{Tc(l) 


(34.5) 


Our problem consists in finding approximate expressions for the functions 
VBt{^od) that correspond to internal final stal-es a and t for the systems C 
and Dj and that arise from tlie unperturbed initial state 


^0 = Uavi{A)Vin{B) exp (*0 ■ Tal,) 

2M ah 






{E - E,an - Ein) 


(34.6) 


Use of the Bom Approximation. Substitution of y[/ from Eq. (34.6) 
into th(^ wave etiuation (31.1) yi(^lds, with th(^ help of (3*1.2), (3*1.3), and 
(34.4) 

^ iir.(C)i<i,{D){'J\,t + AV, + Jiii - K)v,Arri) = -IIU^ (34.7) 
8,1 

If now Eq. (34.7) is multiplicMl through on the l(4t by Ura'{C)Udi'{D) and 
integrated over all t.he coor(linat-(^s of f/ and O, t-he orthonormality of the 
w’s causes all the terms on th(^ left side t-o vanish except that for which 
s = .s' and t = //. We drop t-he primes and write this as 

{Tod + Er.s + Eat - E)vM.d) = - i (34.8) 
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Equation (34.8) can be written in a form that is analogous to Eq. (26.4) : 

(-V|, - - ^ I j U„{C)UM{D)H',iHr^Ta 


= ^{B- - E^) 


(34.9) 


Equation (34.9) with all s and t represents a sequence of exact equa- 
tions that can in principle be solved for the functions v,t. Thin situation 
is similar to that encountered in Eq. (26.4), where we obtained jui approxi- 
mate solution by replacing the exact by the unperturbed solution on the 
right side. We now obtain an approximate solution of (34.9) by replac- 
ing by the 4>a given in (34.6); then the right side is known, and the 
inhomogeneous equation is readily solved for by means of an appropri- 
ate Green’s function. The substitution of for ^ is eciuivalent to the 
assumption that there is very little interaction between the unperturbed 
initial systems A and B. This implies not only that the transition 
■^>E C,D has a small probability, but also that is a good approxi- 
mation to the actual wave function even when the two systems A untl B 
are close together or overlap. In practical cases, it is difficult to set up a 
workable criterion for the validity of this approximation, although useful 
results are l^ely to be obtained when E is large in compaiison with all 
the interaction energies that appear in H'^. 

With the help of the Green’s function (26.15), the solution of the 
inhomogeneous equation (34.9), with 4^ replaced by becomes 

».((0 = - ^ y y y 14 _ [exp (i/f|4 _ r„„|)] 

• u»iC)ilit{D)H'^UamiA)ui„(B)[eKp (ifco • Tab)]d.T^radTc,t (34.10) 

The integration in (34.10) is overall the unprimed coordinate.s ; the 
dement of mtegration can be represented either as dTjT,dTod or as 
aTaClTbdrab, and is abbreviated in what follows as dr. 

The asymptotic form of (34.10) when systems C and D are well 
separated is 




^cd 


r'ed~^ oo 

y “M(G)ii*(D)[exp (— ik ■ r^j)] 


(34.11) 


• E^Mam(A)Mj„(B)[exp (iko • r„6)]dT 

h r thw “"Sles of the vector t^, and k is a vector that 

r34 6t? magnitude given by Eq. (34.9). Equation 

( . ) IS normalized so that the incident flux of systems A and B is the 
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init.ia.1 relative speed co = hko/Mah, and Eqs. (34.6) and (34.11) are 
normalized so that the radial outgoing flux of systems C and D per unit 
solid angle is t)|ff,e(5, <#>)!*, where the final relative speed v = hk/M^. 
Thus the differential cross section for the collision A,B — > C,D is 

«r..(9» = ^ \gM)\^ (34.12) 

Vo 

Lack of Orthogonality of Initial and Final States. There is an arbitrar- 
iness in the expression (34.12) for the cross section that arises from the 
fact that the wave function for the initial state is not in general ortho- 
gonal to the function = UoB{C)UdtiD) exp (^■k • Ted), the complex con- 
jugate of which also appears in the expression for may be 

said to describe a final state in which the systems C and D are observed 
to be moving in the direction 0,0. Since the initial and final wave func- 
tions are eigenfunctions of different unperturbed Hamiltonians Hah and 
Hcdi respectively, they are not expected to be orthogonal to each other. 
If they are not orthogonal, the addition of a constant potential energy 
(which corresponds to zero force) to H^d alters the expression for 
such a change in could be made by adding an arbitrary constant mul- 
tiple of 0 to both sides of Eq. (34.7), thus also changing the magnitude of 
E, We avoid this arbitrariness by defining H'd the energy of inter- 
action between systems C and D that vanishes as Ved becomes infinite; 
the additive constant is then fixed uniquely. A similar definition is 
made for 

It is interesting to note that Hab 1^^ iised in place of in the 
integral for g^t (^,0). This integral is with the help of Eq. 

(22.10) it can be transformed as follows: 

= ![{H — //crf)0/]0or/r = J0//70()rfr — J57j0/0orfr 

where wc have made use of the relation Hod^f = similar fashion, 

can l)c shown to bo ecpial to the last expression by making use 
of the relation //«60o = We see then that 

= JhIfUMr (34.13) 

Equation (34.13) has, for example, the consequence that (7^/(0, 0) = 0 
if H'ab ” even if H^d is^ ^^et zero and the initial and final states are not 
orthogonal. This n^sulli is to he expoct(Ml, since^ if //^b = 0, there is no 
interaction between the colli<ting systems A and B and the transition 
does not occur. 

Coimection with Time-dependent Perturbation Theory. It is also 
possible to derive Etp (34.12) by the method of variation of constants 
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(Sec 29) We expand the wave function ^ in unperturbed final-state 
wave functions ^^here the subscript/ 

stands for the states s and < of systems C and D and for the relative 
propagation vector k : 

^ * (34.14) 


The time-dependent wave equation is 




( 34 . 16 ^ 


Substitution of (34.14) into (34.15) gives 
ih ) d/^/e * = 2, 


%]3/t 

h 


(34.16) 


where use is made of the relation (Hod ~~ = 0. Equation (34.16) 

can be simplified by multiplying through on the left by and int egrating 
over all the coordinates; since the ypf are orthonormal, wo obt ain 


i(Bf' — ■ EAl 

‘ c ^ (34.17) 

/ 

The system of Eqs. (34.17) is exact. We now make two approxima- 
tions, which together are equivalent to the Born approximation sul)- 
stitution of for \p on the right side of Eq. (34.9). Fii^st, w'o iissunio t hat 
the perturbation H'd is small; because of Eq. (34.13), this is o(pn\'tUont 
for our purpose to the Born approximation assumption (bat If'i is small. 
Then we can insert the unperturbed amplitudes a^f on tlu^ rigid, sidc^ of 

(34.17) , and calculate the first-order perturbed ainpliludos on ih(^ 

__ij^ 

left side. Second, we assume that the initial state rl/oc ^ can bo oxpam lt‘(l 
in terms of only those (degenerate) ypf whose energies Ef are Oviual to the 
initial energy E. This assumes that is an eigcnfuiK^.ion of t lu' final 
unperturbed Hamiltonian Hod, which is ecpial to - IJU\ 

since i/'o is actually an eigenfunction of Hah, this also is otpiivahnd. to the 
assumption that the perturbations and are small. 

We can then replace -B/by E in the time factor on the right, side' of Jm|. 

(34.17) and take this factor outside of the summation over/. 44io iin jxm*- 

turbed amplitudes aj are defined by ^ afh, which giA'e,s 

< = IModr 
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With the help of (34.18), the summation over /can be rewritten 

, ^ j h'Hod^fdr J = j 

f 

where use has been made of the closure relation for the complete ortho- 
normal set of functions Thus Eq. (34.17) becomes, to first order, 

ihd^}) = • e * (34.19) 

Equation (34,19) can be handled in precisely the same way as Eq. 
(29.7), and yields the differential collision cross section (34.12). 

Exchange Collisions of Electrons with Hydrogen. As a simple first 
example of a rearrangement collision in which effects of identity and spin 
appear, we consider the elastic scattering of an electron from a hydrogen 
atom. In a problem of this type, we must know the asymptotic forms of 
the unsymmetrized wave function for all permutations of identical par- 
ticles.^ A wave function that has the proper symmetiy character can 
then be constructed by the methods outlined in Sec. 32. We first obtain 
the asymptotic form of the wave function when the incident electron is 
scattered and when the incident electron exchanges with the atomic elec- 
tron, to the accuracy of the Born approximation; spin-dependent inter- 
actions are neglected. 

The incident and atomic electrons are denoted by 1 and 2, respec- 
tively, in the unsymmetrized wave function. The asymptotic form of 
the stationaiy wave function i^(ri,ro) that corresponds to nonexchange 
elastic scattering with the total energy E is a product of the ground- 
state hydrogen wave funci-ion i^ioo(r 2 ) for electron 2, and an inc.ident- 
plane plus outgoing-scattered wave for elecitron 1: 

^(rijia) > [exp (-dc,) ■ Ti) + rrh;'^'«^/( 0 ,)]/Xio„(r 2 ) 

_ ,, , mr* (34.20) 

2m ~ 2ft2 

The work of Rees. 20 and 30 shows that tlu^ scuii tered amplitude has the 
form 

• (exp iko • ri)ai„n(r 2 )fiTidr 2 (34.21) 

where k is a vector of magnitude /(•« that has the direction di (/ does not 
depend on the other polar angle <#)i). 

^ J. R. Opponheimor, Pliytt. Rev., 32, 361 (1928). 
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The asymptotic form of ^(ri,rsi) that corresponds to elastic exchange 
scattering is a product of the ground-state hydrogen wave function «ioo(ri) 
for electron 1, and an outgoing scattered wave for electron 2. 

^(ri,rs) > r5-‘e***'>^(02)wioo(ri) (34.22) 

rj— > 00 

There is no plane wave in this case, since electron 2 is not incident on the 
atom. In accordance with Eq. (34.11), the exchange scattered amplitude 
is 

~ ^ • r^)]«ioo(ra) 

• (exp dko • ri)wioo(r 2 )dTidr 2 (34.23) 

where k has the magnitude i*o and is in the direction 6^, 

We must now form an antisymmetric wave function from products of 
^(rijra) and appropriate spin functions. The spin functions can be taken 
to be the set of four given after Eq. (33.6); however, it is simpler to make 
use of the four symmetrized combinations (33.6). The spin of the inci- 
dent electron is not assumed to have any definite relation to the spin of 
the atomic electron. In this case we can use either of these sets of spin 
functions, calculate the scattering with each of the four spin states of a 
set, and then average the results with equal weights for each state. ^ The 
first three of the spin functions (33.6) are symmetric, and must be multi- 
plied by the antisymmetric space function ^(ri,r 2 ) — ^(r 2 ,ri); the fo\irth 
spin function is antisymmetric, and must be multiplied by 


^(rijra) + !/'(r3,ri). 


Differential Cross Section. The asymptotic forms of the symme- 
trized space functions for large values of one of the electron coordinates, 
say Ti, are obtained from (34.20) and (34.22). 


^ This is a consequence of the fundamental hypothesis of quantum statistical 
mechanics; see, for example, R. C. Tolman, “The Principles of Statistical Mechanics,'* 
Sec. 84 (Oxford, New York, 1938). It can be shown that either of two complete 
orthonormal sets of wave functions can be used in such a statistical calculation (in the 
present problem, the two sets are complete so far as the spins of two electrons are 
concerned). The two seta, say Vn and Ub, were shown in Sec. 22 to be connected by a 


unitary transformation: Vn = ^SknUh, where is a unitary matrix. Then 

k 

y |y«|® == y SjmBla^nUkilk' * ^ hhnibilk' =* ^ 
n nXk' ki' T 


Since the probability of observing a given event (such as the scattering in a particular 
direction) is proportional to the square of a wave function, the same average result is 
obtained from a statistical mixture of either set of wave functions. 
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tA(ri,ra) ± 

>• [exp (?ko • ii) + ± r~‘e*'"^^(0i)]wioo(r2) (34.24) 

n— ► *0 

The first two terms in the bracket on the right side of (34.24) come from 
the first term on the left side, and the third term on the right side comes 
from the second term on the left side. The differential cross section must 
be computed with the upper sign in one quarter of the collisions, and with 
the lower sign in three quarters of the cases. We thus obtain 

<e) = i\m + + iim - r/(0)r- (34.25) 

Equation (34.25) can also be derived without explicit reference to 
the spin wave functions, as was Eq. (33.2), by making use of the earlier 
observation that particles that have different spin components are 
distinguishable. In half the collisions, the electrons have different spin 
components, and the cross section is just the sum \f{6)\^ + |f7(0)|^ of the 
direct and exchange cross sections; in the other half, the electrons are 
indistinguishable, and the antisymmetric space function must be used. 
We thus obtain 

crid) = + \g{e)[^) + i\f{e) - g{d)\^ 

which is easily seen to be the same as (34,25). 

An integral of the form /(exp ik ■ i)F{x)dT is small if ka » 1, where it 
is ossumefl that F is a smooth function of r that becomes small for r > a. 
Since both the ii anti I 2 integrals in (34.23) are of this type, we expect 
g to be quite small in comparison with /for /rao » 1. This is the situation 
in which the Born approximation is most applicable, so that the correc- 
tions due to exchange are expected to be fairly small for the cross sections 
calculated in StKi. 30. 

Exchange Collisions with Helium. In dealing with the ehistic scatter- 
ing of an electron from a helium atom in its ground state, it is convenient 
to work with the space and spin wave functions together. According 
to the discussion of Sec. 33, the two clc(d.rons in the helium atom are in a 
symmetric spac^^ state and an antisymmetric (singlet) spin state. Thus 
if the incident electron is denoted by 1 and the atomic electrons by 2 and 
3, the unperturbed wave function is (exp ^ko * 11 )^ 0 ( 12 , r 3 )y(l, 2, 3), where 
no is the symmetric space function for the normal state, and v(l,2,3) 
is a spin fuiujtion that is antisymmetric in 2 and 3. The eight spin 
functions for three elecd.rons arc grouped in (33,9) acc.ording to symmetry 
in 2 and 3; it is apparent that t;(l,2,3) must be one of the last doublet pair 
given there. 

The asymptotic forms of the first-order perturbed wave function 
including spin are found to be 
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V'(l,2,3) > [(exp tko • ri) + 7TV(0i)]Mo(r2,rj)j;(l,2,3) 

n— ► « 

— ^ r-i-y (02)uo(r8,ri)«(2,3,l) (34.26) 

fs-^ 00 

>-ri-y(0t)wo(ri,r2)»(3,l,2) 

f w 

where there is antisymmetry only in electrons 2 and 3. Here 
° “^*/// [exp (-ik-ri)]fio(r2,r3) 

. ( £ i ) (exp ^ko * Ti)Uo(j 2 ft 3 )dTldT 2 dTi 

V 12 ^18 ^1 / 

and a spin term t;*(l,2,3)?;(lj2,3) = 1 has been omitted. Also 
g'ie^) = ^(^2>*(2,3,1>(1,2,3), 

“ ~ & j f j (34.27) 

( ga q 2 2e^\ 

1 J (expdro * ri)uo(r2,r3)rfric?r2rfr3 

7'i2 ^28 7*2 / 

with a similar expression for g'iOz). The product of spin functions in 
(34.27) is readily evaluated by making use of the one-electron functions 
(33.4) and remembering that is the Hermitian adjoint of v. We take 
for d( 1,2,3) the next to the last spin function of (33.9), and obtain 

!;*(2,3,1)i;(1,2,3) = 2“*[(~ ++)* - (+ + -)*] 

-2-i[(+ + -) -(+-+)]= - i (34.28) 

The completely antisymmetric wave function derived from i/^(l,2,3) 
is given in Eq. (32.3) with the lower sign. Since yp is already antisym- 
metric in its last two arguments, it is apparent that the second bracket 
terms in (32.3) duplicate the first bracket terms. The iusymptotic form 
of the wave function for large values of one of the electron coordinates, 
say ri, is then obtained from (34.26) and (34.28); 

^(1,2,3) + ^(2,3,1) + ^(3,1,2) 

{(exp iko • rO + ~ i^(0i)]l ao(r2,rH)Kl,2,3) (34.29) 

The differential cross section obtained from (34.29) is 

<r(^) = |/(^) - g{e)\^ (34.30) 

Like Eqs. (33.2) and (34.25), Eq. (34.30) can be derived without 
explicit reference to the spin functions. Since the two atomic electrons 
must have antiparallel spins (singlet state) in order for the helium atom 
to be in its ground state, the spin component of the incident electron 
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is the same as that of one of the atomic electrons and different from 
that of the other. It cannot exchange with the latter in an elastic 
collision, since then both the resulting atomic electrons would be in the 
same spin state and the exclusion principle would force the atom into an 
excited state. Thus it can only exchange with the electron with which 
it is indistinguishable, so that the antis 3 nnmetric combination of direct 
(/) and exchange (g) amplitudes must be used; this gives Eq. (34.30). 

In the absence of spin-dependent interactions, the excitation of a 
triplet state of helium by electron impact can be accomplished only by 
exchange between the incident electron and one of the atomic electrons. 
In this case, there is no direct (/) amplitude, and hence no interl’erence 
between direct and exchange amplitudes. 

Problems 

1. Show that the antisymmetric wave fimction given in Eq. (32.7) vanishes if 

there is an identical linear relation between the functions Va, , Vv. 

2. Show that if a wave function w(l,2, ... n) is an energy eigenfunction of a 
symmetric Hamiltonian that corresponds to a nondegcnorate eigenvalue, it is either 
symmetric or antisymmetric. Show this first for n =• 2, then for n « 3, and then 
indicate how the proof c,an bo extended to arbitrary n. 

3. Verify that the spin wave functions given in Eep (33,6) are eigenfunctions of 
(Si + Sa)® and *S^i, -|- with the indicated eigcnivaliioH, Show also that the result of 
operating on these functions with the .r and // conipejicnts of the total spin is in agree- 
ment with the appropriate matrices given in E(is. (24.15). 

4 . Carry througli the (uilcuUitions of Prob. 3 for the spin functions given in Eq. 
(33.9). 

6. Find the (ugciif unctions of i,hc square of tlui magnitude of the total spin and 
tlie z compoiKuit of the total of four elecdroim, and show that they can be grouped 
into one quiiiU't, threes triplet, and l,wo singled slraU's. (Ilinl: Start with the triplet 
and singled. Hi)in functions for twe> i)airs e)f two elced-rons, and make use of the inatrie^es 
(24.15) te)gctheu’ with the (H)rreiHpe)neling sed fe)r j =2.) 

6. Use Eep (33.2) U> write down an expression for the serattering of pre^tons in thei 
cemter-of-nuiss e^()e)relinat(^ systeun, fissuiuing that the^ (loule)i!d) interactiem e^xtends in 
to r =» 0. Discuss the^ classie^al limit e)f (,hc e.ross seu'tion (h ->0), partieuihirly in I, he 
iienghben-hexxl of 0 = 90°, and show that t he^ inten’fe're'iie'e te^in elropsout if the average 
sewittering over an arbitrarily small but Unite range of angle is (‘e)rn[)uteMl. 

7. Show that the \s2p (!()nfigiinitie)nH in helium can be treMitexl separately fre)m 
the 1«2« e'.e)nfigiiratie)ns se) far as the> (irst-eirile^r emergy-Ievel e^ale*. illation of Sex?. 33 is 
coiieK'.rneMl. 

8. What woulel he the uiqiertnrbeel groiinel-state wave rune-tieins of helium if each 
edcctron hail spin :ingular moinentuin h and obeyi'cl Minslein-Hose, statistics? 

9. Write down the unp(‘rtnrheel ground-staU^ wave runed.iou for a neutral lithium 
atom. 

10. Show l)y direet e^ale-ulation that Eep (31.25) is obtainoel if the incident and 
at/omic eleidrons arc assumi'd to he <l(\se^rihed by Lhe four spin wave funedions (-|-4-), 

(H — ), ( — h), aiiel ( ), rather than by the tripled and singlet combinations (see^ 

footnote 1, page^ 236). 



CHAPTER X 

SEMICLASSICAL TREATMENT OF RADIATION 


No account has thus far been given in this book of the interaction 
between material particles and electromagnetic radiation. As would be 
expected, a treatment that is consistent with the foregoing theoiy of 
material particles requires that quantum equations of motion of the 
electromagnetic field be foimd that are analogous to MaxwelFs equations. 
Indeed, it is only in this way that Planck^s original quantum hypothesis 
can be fitted into a general theoretical framework. The development of 
the el emen ts of a quantum theory of radiation will be postponed until 
Chap. XIV. In the present chapter we treat the electromagnetic field 
classically and the particles with which the field interacts by quantum 
mechanics. Such a semiclassical treatment is bound to be incomplete 
and not wholly satisfactory, although it is simpler in principle than the 
quantum electrod 3 mamics presented in Chap. XIV. We shall find that 
it is possible in this approximate way to give a plausible and correct 
account of the influence of an external radiation field on a system of 
particles (absorption and induced emission), but not of the influence of 
the particles on the field (spontaneous emission). Nevertheless, the 
results of the classical treatment of the latter phenomenon can be con- 
verted to quantum theory in a correct, if not very convincing, manner. 
Some simple applications of the theory are given in Sec. 37. 


36. ABSORPTION AND INDUCED EMISSION 
The Schr5dinger wave equation for the motion of a particle of mass m 
and charge e in an electromagnetic field described by the potentials 
A,<^, with an additional potential energy V, is obtained by adding a term 
V'p to the right side of Eq. (23.24). 


r 


2m ^ me 






We regard V as the potential energy that binds the particle (of electro- 
static origin if the particle is an electron); A,<^) represent an electromag- 
netic field that is weak enough so that those terms can be regarded as a 
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perturbation. Our object is to calculate the probabilities of transitions 
between stationary states of the particle in the potential energy 7, that 
are produced by the field. We first discuss some properties of the field 
and its plane wave solutions. 

Maxwell’s Equations. Maxwell's equations of motion for the elec- 
tromagnetic field are, in Gaussian units, 


, r , 1 f. 


c dt C 


div E = 4tp, div H = 0 


(36.2) 


If the divergence of the second of these equations is combined with the 
time derivative of the third, we obtain the equation of continuity for the 
electric charge and current densities p and J 


divJ+i'.O 


(36.3) 


The electric and magnetic field strengths can be expressed in terms of the 
potentials by Eqs. (23.15). 

1 i)A 

E = - i ^ - grad H = curl A (36.4) 

C ot 


which cause the firat and fourth of Eqs. (35.2) to be satisfied identically. 
The potentials are not defined uniquely by Eqs. (35.4), since any Ajc#) that 
give the correct E and H can evidently be replaced by new potentials 
A',<A' without altering the field strengths, where 

A' = A + grad x, ^ ^ (36.6) 


and X w an arbitrary futud.ion of r and L 

Substitution of (35.4) into the second and third of Eqs. (36.2) gives 


, , . , 1 c)2A , 1 .d<t> 47r - 

curl curl A + + -grad - J 


i ^ div A + = -“4'7rp 

C at 


(36.6) 


If the vector A is written in rectangular coordinai.es, we can put 
curl curl A = grad (div A) — V-A, 

where the Ijist term is vector whose components are the Laplacians of 
the components of A. We can therefore simplify lOqs. (36.6) by making 
a gauge tramijormation (35.5) from A|</) to A',(^>' such that the new poten- 



242 


QUANTUM MECHANICS [Chap. X 


tiala satisfy the Lorentz condiMon: 

div A' + i ^ = 0 
c at 

The gauge function x then satisfies the equation 

i ^ = - fdiv A + 

^ dt^ \ ^ c dtj 

Equations (36.6) then become 


VW - 1 

c* df 

^ c* dt^ 


= -tJ 

= -'47rp 


(36.7) 


(35.8) 


(35.9) 


Plane Electromagnetic Waves. If J = 0 and p = 0 (completely 
empty space), it can be shown that it is possible to choose the gauge 
function so that div A' = 0 and = 0 for all r and without loss of 
generality (see Prob. 1). Then transverse plane wave solutions can be 
found for A', and hence also for E and H. We drop the primes and have 
in this case 

= divA = 0 (35.10) 


A typical plane wave solution of (35.10) is one that represents a real 
potential with the propagation vector k and the real polarization vector 
lAo|: 

A(r,^) = 2|Ao| cos (k • r — wi + a) 

= Ao[exp i(k • r — wi)] + c.c. (36.11) 


Here denotes the complex conjugate of the term that precedes it, 

and the constant complex vector Ao is defined to be |Ao|e'«. The first of 
Eqs. (35.10) is satisfied if w = kc, where k is the magnitude of k, and the 
second is satisfied if Ao is perpendicular to k. 

The electric and magnetic fields associated with the vector potential 
(35.11) are 

E = --2/c|Ao| sin (k • r - 01^ + a) 

H = —2k X lAo| sin (k • r — w/ + a) 

The Poynting vector (c/4t)E X H is evidently in the direction of k; its 
magnitude averaged over a period 27r/o) of the oscillation is 
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where |Ao|^ is equal to the scalar product of |Ao| with itself (|Ao| • [AoD or 
the scalar product of Ao and its complex conjugate (Ao * Ao). Equation 
(35.12) is the intensity associated with the plane wave (35.11). 

Use of Perturbation Theory. We now return to Eq. (36.1), and use 
it to calculate the probability of a transition between stationary states 
that is produced by the vector potential (35.11), which is regarded as a 
small perturbation. The third term (div A) and fifth tenn on the 
right side of (35.1) are now zero. The ratios of the second to the first 
term and the fourth to the second term on the right side of (36.1) are of 
order cA/cp, where p is the momentum of the particle. The magnitude 
of this quantity is estimated in a practical case in Prob. 2, and is so small 
that the perturbation approximation is justified. Thus to the first order 
of perturbation theory, we can neglect the term e^A‘^/2?nc®, and rewrite 
Eq, (35.1) 

ih ^ = (ff„ + H')i^ 

ieh 

^^0 = - ^ V* + 7(r), H' A- grad 

2m ' me ^ 


We proceed as in Sec. 29, and expand ^ in stationary eigenfunctions 
i4(r) of the unpert/urbed Hamiltonian //o with time-dependent coeffi- 
cients ak(f). If the system is initially in the state n and the perturbation 
is turned on at i = 0, the first-order amplitudes at time t are given by an 
expression similar to (29.17). 


HZ 


HZ - 1 771 '” - 1 


h Ui/no — OJ h Oikn + W 

= — / /7fr(exp • r)A() ■ grad Un<h 
me J 

ff'kn = ^ y '''Np C-?-k ■ r)]Aii • grad u„(h 


( 35 . 14 ) 


As disciLssod in Se(^ 2i), tli(^ probahiliiy of finding the system in the 
state h is appreciable only when the denominator of one or the other of 
the two terms in (35.11) is pra<^ti(‘-ally zero. Tliere is no inteHereiu^e 
between the two terms: the fit^st is important wluai Ek = En + Aoj, and 
the second is important when Ek = En — ho). Thus the probability of 
finding the system in a state Ic that has an energy higher than the initial 
state by about ho^ is proportional to \HZ\'^, and the probability of finding 
the system in a state /r' that has a correspondingly lower energy is pro- 
portional to \HkPn[\ 

Transition Probability. The discussion of Sec. 29 shows that the 
transition probability per unit time is independent of the time only if 
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the final state can be any of a very closely spaced or continuously dis- 
tributed group. The need for a group of final states arises from the 
dependence of the probability lajJKOI® on the energy, which is shown in 
Fig. 27; it is the area under this curve, not the ordinate at a particular 
abscissa, that is proportional to t 

In the same way, a constant transition probability per unit time is 
obtained in the present problem if the incident radiation is monochro- 
matic (definite value of co) and transitions can occur to any of a group of 
closely spaced or continuously distributed final states. The result is 
Eq. (29.12) with either or substituted for However, the 

computation of a transition probability between two discrete states is 
often of interest. In this case, the transition probability per unit time 
is not constant in time, if the incident radiation is strictly monochromatic, 
and depends markedly on the difference between w and 

IcOftnl = \Ek — En\/h, 

What we do in this case is to assume that the radiation covers a spread of 
frequencies with no phase relations between the different frequency 
components, so that the radiation can be characterized by an intensity 
per unit frequency range that is constant in the neighborhood of 

The probability of finding the system in the final state is then propor- 
tional to iHftnl* or which in turn is proportional to |Ao|^ and hence 

to the intensity. If the intensity in the small angular frequency range 
Ao) is J (a>) Acoj Eq. (35. 12) tells us that we can put 

|Ao|* (36.15) 

where Ao is the vector potential amplitude that characterizes the fre- 
quency range Ato. The probability that a transition in which the system 
is left in a higher energy state (S* ^En + fuo) has taken place at the 
time t is then 


_ V sin* iCw*. - u)t 

"4 - «)* 




T/' na I 

—5 — 5 i(a))Aa) 




Wjfc(exp {k • r) grad^^^„dr 


sinH(to*n cc)t ( 35 10 ) 

{Oihn — 0 ))^ 


^ For a discussion of the situation in which the intensity is not constant near |cjjbn|. 
see W. Heitler, “The Quantum Theory of Radiation,” 2d ed., Sec. 16 (Oxford, New 
York, 1044). 
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where grad^ is the component of the gradient operator along the polariza- 
tion vector Ao- The contributions to the probability from various fre- 
quency ranges are additive, since there are no phase relations between the 
radiation components of different frequencies. 

Each frequency range Aoj in (35.16) can be made infinitesimally small, 
and the summation replaced by an integration. Since the time factor 
has a sharp maximum at w = wfen, the other factors that involve co can 
be taken outside the integral and the limits on 6) extended to ± oo , as was 
done in going from Eq. (29.10) to (29.11). Thus the transition prob- 
ability per unit time for an upward transition becomes 


j f fi*(exp ik. • r) grad, 

J _ ca o\,mn 

4^2g2 I r 

= 2 / Ukiex-p ik • r) grad^ Undr 

^ \ J 


\ 

sin® ^{ukn — (is)t J 


(35.17) 


where the magnitude of k is now Ukn/c. An expression very similar to 
(35.17) is obtained for the probability per unit time of a downward 
transition {Ek' = E„ — hu ) : 


4ir®c® 


/ 


/(wnA-O / wjfc'[exp (— -ik • r)] grad^ Undrl 


(35.18) 


In this case the magnitude of k is ojnk'/c. 

Interpretation in Terms of Absorption and Emission. Equations 
(35.17) and (35.18) give probabilities per unit time for transitions of the 
particle between stationary states under the influence of a classical 
radiation field. These expressions can now be interpreted in terms of 
absorption and emission of cpianta of electromagnetic radiation. It is 
hccessaiy to assume that such (piaiita exist and provide the energy units 
of the radiation field, and that energy is conserved between field and 
particle. The particle gains the amount of energy Ek — E^ in an upward 
transition under the influence of radiation of angular frequency a)A„. The 
({uantum energy of this radiation is ho)kn = Ek — En, so that it is reason- 
able to associate with the upward transition of the particle the absorption 
of one quantum from the radiation field. 

In siinilar fashion the downwai'd transition is associated with the 
emission of one quantum whose energy corresponds to the frequency of 
the radiation field. In accordance with Eq. (35.18), the emission prob- 
ability is proportional to the intensity of the radiation present. This 
process is therefore referred to as induced emission. 
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It is sometimes convenient to rewrite Eq. (35.18) in terms of the 
reverse transition to that which appears in (35.17). Equation (35.17) 
describes the transition from an initial lower state w to a final upper state 
A*; (36.18) can be made to describe the transition from an initial upper 
state At to a final lower state n, if n is replaced by k and ¥ by n. Then 
(36.18) becomes 




/(w*n) 


|2 


Wn[exp (— tk ■ r)] gradx Ukdr\ 


(36.19) 


We can now show that the integral in (35.19) is just minus the complex 
conjugate of the integral in (35.17). By means of a pariial integration, 
or with the help of Eq. (22.10), the integral in (35.19) is seen to be equal 
to^ 

— Jiza grad a [un exp (— • i)]dT 


Since only the component of the gradient along the polarization vector Ao 
appears, and this direction is perpendicular to the propagation vector k, 
the operator grad^ does not affect exp (— • r). Thus the integral in 
(35.19) is equal to 


— JtiA;[exp ( — * • r)] grad^ Undr 

and the square of its magnitude is equal to the square of the magnitude 
of the integral that appears in (35.17). 

Since (35.17) and (35.19) are the same, the probabilities of reverse 
transitions between any pair of states under the influence of the same 
radiation field are equal. 

Electric -dipole Transitions. In most cases of practical interest, the 
wave length of the radiation is many times greater than the linear dimen- 
sions of the wave functions that describe the motion of the particle. This 
means that the quantity k • r that appears in the exponential in the 
integral of (35.17) is small in comparison with unity wherever u„ and 
Uk are large enough to give an appreciable contribution to the integral. 
A good approximation is then obtained by replacing exp • r by 1. 

The resulting integral can be simplified by expressing it as a matrix 
element of the momentum of the particle 


J Uk gradA Undr = | j ^kTAUndr = | (pA)jtn 

where Pa is the component of the particle momentum p along the direc- 
tion of polarization of the incident radiation. The matrix theory of Sec. 

^ In using Eq. (22.10), it mtist be remembered that the operator i grad, not tlie 
operator grad, is Hennitian. 
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23 shows that the momentum matrix of the unperturbed particle is given 
by p = m{dx/dt). Thus from Eq. (23.27) 

\ d i 

~ (p)/5n “ ^ (r)An — ^ (-®A> ^n)(x)kn “ i<^hn(j!^kn 

The integral in Eq. (35.17) becomes, in this approximation, 

/ Wl YH/ f 

Uk gradx Undr ^ - j <^ku(rA)kn = “ j / UkTAUndr (35.20) 

where is the component of r along the direction of polarization. Equa- 
tion (35.20) can, of course, also be derived without recourse to matrix 
methods (see Prob. 3). 

Transitions for which the probability can be computed by substitu- 
tion of (35.20) into (35.17) are called electric-dipole transitions, since only 
the matrix element of the electric-dipole moment er of the particle is 
involved.^ The transition probabilities per unit time for absorption and 
induced emission then become, in the dipole approximation, 

4-r2«2 

(35.21) 

It is convenient to denote by (r)*^ the vector whose cartesian components 
are the kn matrix elements of Xj and z, and to put 

l(r)*„|“ = (rHn-T?}!; (35.22) 

which is the scalar product of (r)jtn and ihs complex conjugate. The 
reason for doing this is that there are usually pairs of states k and n for 
which |(r)A:»|''^ is the same, but for which the vector {x)kn has various orien- 
tations in space.'-^ Then if 0 is the angle between (r)jtrt and the direction 
of polarization of the intndent radiation, |(r)/t„|'-* cos- 0 can be substituted 
for l(r/)i;.n|‘^ in ICq. (35.21), and an average performed over 0. The 
average of (35.21) for such paii-s of states is then 

» ( 35 . 23 ) 

Forbidden Transitions. It may happem that the dipole matrix ele- 
ment (r)fc„ is zero for particular states k and n. In that case the approxi- 

^ The quant, ily n Ih Iho (^Inririr-dipolo moiucuil of t.lin partiolo of charge a with 
roapect to an arbitrarily i)lac('d origin ; t he addition of a constant vector (corrcHponding 
to a shift in the origin) to r docs not aihud, the matrix element (35.20), eiiieo Uk and ?/„ 
are orthogonal. 

^ For example, if th(^ particle moves in a splK^rienlly synimetric potential F(r), tho 
state k can have I = 0, and the state-s n (tan have / = 1 and throe values (0, ±1) for 
the magmttie (inantiiin niimher ?a. 
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mate replacement of exp ik • r by 1 in the integral of (35.17) is not 
justified. The exponential can be expanded in a power series 

exp ik - r = 1 + ik • r + (^k • r)2 + ■ • * 

or in a series of spherical harmonics like (19.9) 

exp tk • r = jo(^r) + 3iii(/lT)Pi(cos B) — 6) +•• • 

where 6 is the angle between k and r. The second series is more con- 
venient than the first if, as is usually the case, the wave functions Uk 
and Un can be expressed in terms of spherical harmonics. 

With either series, the dominant factor in the nth term is proportional 
to (At)" if A-r <3C 1 [see the first of Eqs. (16.7)]. Thus if the dipole matrix 
element vanishes but the next term of each series does not, the transition 
matrix element is reduced by a factor that has the order of magnitude A’a, 
where the linear dimensions of the particle wave functions are of order a. 
A transition of this type is called a forbidden transition^ since its prob- 
ability is reduced by a factor {kaY with respect to dipole or allowed 
transitions j and usually A-a <K 1. Successive terms in the series can be 
interpreted in terms of electric-dipole, -quadrupole, etc., transitions, and 
involve successively higher powers of ka. 

If both the states Uh and Un are spherically symmetric, the integral 
/^^(exp ik ■ r) grad^ Undr is identically zero. This can be seen by choos- 
ing cartesian coordinates for performing the integration such that the x 
axis is along the direction of polarization. Then gradx Un is an odd 
function of x, whereas Uk is an even function of x, and exp ^■k • r = 
is also even in x, since the vector k is perpendicular to the direction of 
polarization and hence is in the yz plane. Thus the integrand is an odd 
function of x and the integral in (36.17) vanishes. The transition 
between these states is said to be strictly forbidden, since the first-order 
probability given by (36.17) is zero. It is still passible for transitions to 
be produced by higher orders of the perturbation IF given in Eq. (35.13) ; 
in such a calculation, the previously neglected term e^k^f2mc^ must be 
included in H', However, it can be shown with the help of quantum 
electrodynamics that such higher order transitions involve more than 
one quantum, and hence are not simple emission or absorption processes 
in which the quantum energy is equal to the energy difference between the 
unperturbed states k and n of the particle. 


36. SPONTANEOUS EMISSION 

A classical charged oscillator can absorb energy from a radiation field 
or give energy up to it, depending on the phase relation between the field 
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and the oscillator. These effects are analogous to the absorption and 
induced emission of the last section. A classical oscillator also emits 
radiation spontaneously, whether or not an external radiation field is 
present. In this section, we calculate the electromagnetic radiation 
from a classical oscillating charge-current distribution in the absence of 
external fields, and somewhat arbitrarily rewrite the formulas in terms of 
quantum matrix elements to obtain a probability for spontaneous mis- 
sion. The results are then verified by comparison with Planck^s expres- 
sion for the distribution of thermal radiation in a cavity. 

Classical Radiation Field. A charge-current distribution can be 
completely specified by the current density J, since J determines the 
charge density p through the equation of continuity (35.3). In similar 
fashion, the electromagnetic fields in empty space, away from charges 
and currents, can be completely specified by either E or H, because of the 
connections (35.2) between them. Now a wave equation for H is 
readily obtained by taking the curl of the first of Eqs. (35.9). 

(36.1) 

Thus H can be obtained in terms of J alone, whereas the similar equation 
for E involves both J and p (although p can, of course, be eliminated). 
We proceed by solving Eq. (36.1) for H when the three cartesian com- 
ponents of J vary harmonically in time with the same angular freciuency 
a but not necessarily with the same phase: 

J„(r,0 = 2|./,(r)| cos (coi - rj,) = -f c.c. 

J.{r) = l./.(r)le*- ^ 

with similar expressions for the y and 2 components. We are interested 
only in the steady-state solutions for E and H that have the same fre- 
quency (0 

E.(r,0 = 2|E.(r)| cos (tof - t.) = E*(r)c-^‘ -f c.c. 

H»(r,<) = 2|H;.(r)| cos {wt - f,) ■= Hx(r)n-'“‘ + c.c. (36.3) 
Ex(r) = lEx(r)lc*«', Hx(r) = |Hx(r)lc«* 

again with similar expressions for the y and z components. From the 
second of E(is. (35.2), E is given in terms of H in empty space l.)y 

E(r) = curl H(r) (36.4) 

0) 

With subRtitutions from (36.2) and (36.3), Eq. (36.1) becomea 

(V^* + fr=)H(r) = - ~ curl J(r), /r = (36.5) 



260 


QUANTUM MECHANICS 


[Chap. X 


expressed m terms of the Green's function (26.15), and is 
UM ^ ^ f curl T(r0 ^ 

" ^ C J ~\t- r'l ~ (36.6) 

0^ (36.6); for large r, this solution 
+V, wu ^ and i like r-»e«*’-“0 + n c so 

*’^ * ttaVkud 

ence is retarded with respect to the current element. 

interested in the energy and angular 

STomH-r”;? ^ below, the energy can 

field at verv In ^ terms in the asymptotic expression for the 

^ adSr^i?;/' ^ *be angular momentum requires 

i“4rtd of ?rfi f r-dependent part of the 

ntegrand of (36.6) can be expanded in powers of 1/r 

ei g (fA-|r - r'|) ^ i / r' cos + Ukr'^ sin^ e\ , 

|r - r'l r V d r ) (36.7) 

where d is the angle between r' and r. Substitution of (36 7) into (36 6') 
ponent ^ ^ *be oscillation has as a typical com- 


P.(r) - ^{|E,(r)| |H„(r)|[cos (o,t - f,) cos (o,t ~ 

^ ~ |Ev(r)| |Hx(r)|[cOS (o)t - f„) cos (ut — fx)]tbn.av..} 

= ^ {|Ex(r)| |H,(r)| COS (f. - f,) _ |E,(r)| |H4r)| cos (f„ - f,)} 

This and the other two components can be put in the form 


P(r) = ^ Re[Eir) X FRF)] ( 36 . 8 ) 

wtTtSstS°oSv ?n tho tke expression that follows. Now we 
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from the center of the charge-current distribution to the point at which 
the field is measured. Equations (36.4), (36.6), and (36.7) then give, 
to order 1/r (where now r = z), 


^ j Jy{r')e-^-'dT' 



1 .7*(r')e-'*''dT' 

H.-^o 



r 

./*(r')e-“''dT' 


r 

E.-^0 



(36.9) 


Partial integi'ation has been used to get rid of the derivatives of J in 
the integrand of H. Equations (36.9) show that the asymptotic fields 
are transverse to the direction of propagation. They also relate the 
polai'ization of the emitted radiation to the current distribution, and 
show that only the component of the current perpendicular to the direc- 
tion of propagation contributes to the radiated energy. Substitution 
into (36.8) gives 

P. = 2^^ (1/ + 1/ (36-10) 

Equation (36.10) can be generalized to give the average energy flux in 
the direction of a vector k 


2nr‘^c 


/ 


•/xk(r')[(^xp (-^k • r')]^/r' 


(36.11) 


where Jxk is the component of J perpendicular to k. 

Dipole Radiation. Equation (36.11) is an exact expression for the 
energy radiated by the classical current distribution (36.2), As in Sec. 
35, the ele(^tric-dipolc approximation is obtained in the long-wave-lenglh 
limit by assuming that W <$C 1 and replacing exp (— ik • r') by 1 in the 
integrand. The energy flux is then 


%rr^c 




(36.12) 


Fi'om Eqs. (36.9) with the same approximation, it is apparent that the 
polarization (direction of the electric field) of the radiation is determined 
by the total current vector Jo = JJ(r')^fT'. The radiation is linearly 



262 


QUANTUM MECHANICS 


[Chap. X 


polarized if Jo has only one component in the plane perpendicular to the 
direction of propagation, circularly polarized if Jo has two equal com- 
ponents in this plane that are perpendicular to each other and 90® out of 
phase (so that one component is i times the other), etc. 

The angular distribution of the radiation can be found by replacing 
(36,12) by (Jo - Jo) sin^ B = |Jo|^ sin^ 6, where ^ is the 
angle between Jo and k, and |Jop is an abbreviation for the scalar product 
of Jo and its complex conjugate. The total power radiated is then the 
integral of (36.12) over the surface of a sphere of radius r 

^ iJol* (36.13) 

Angular Momentum. The angular momentum radiated per unit 
time is equal to the torque exerted on a large, perfectly absorbing sphere 
that is centered at the charge-current distribution that constitutes the 
source of radiation. The average energy flux is P, so that the (directed) 
energy density is (l/c)P and the momentum density is (l/c^)P. Since 
the radiation travels outward with speed c, the torque exerted on a 
perfectly absorbing differential element of area, dA^ that is perpendicular 
to r is cdA times the vector product of r and the momentum density: 
{dA/c)(t X P). Integration of this quantity over the sphere of radius 
r gives the angular momentum radiated by the source per unit time. 
Thus only the components of P tangential to the sphere are involved: in 
the notation of Eqs. (36.9), these are P® and since the z axis is along r. 

If Eg and were zero, the tangential components P* and Py would 
also be zero, and no angular momentum would be radiated. The third 
and sixth of Eqs. (36,9) imply only that the z (radial) components of the 
field are of smaller order than 1/r; actually they are of order l/r^. This 
means that P» and Py fall off as l/r* for large r. Then since r X P appears 
in the expression for the angular momentum, and the area of the absorb- 
ing sphere is proportional to r®, the total angular momentum absorbed by 
a large sphere is independent of r. 

We require the l/r^ terms in E* and H«, but not in the other field 
components. 

f [y'J.ir') - x'Jy(.t')]e-^*^dT' 

K (36.14) 

Equations (36.9) and (36,14) are sufficient for an exact calculation of the 
radiated angular momentum. 
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Dipole Case. The expressions for P® and Py are simplified by the 
dipole approximation since only the terms of lowest order in W need be 
retained in (36.9) and (36.14). It is easy to see that threading term in 
P«, for example, is — (c/2T)Pe(EaHy) and not (c/2ir)Pe(EvH«). We obtain 
to lowest order in W 


f f 


(36.16) 


Equations (36.15) refer to axes fixed with respect to the element dA 
of absorbing area at r. They must now be rewritten in terms of general 



Fig. 20. Holtttion between the unprimod ooordinnlo ayHtem of Eqs. (30.15) and the primed 
coordinate Hyatoin of Eq. (30.16). 

cartesian coordinates, in order that the angular-momentum component 
about a particular axis fixed in space can be found. This is analogous to 
the rewriting of the energy-flux expression (36.10) in the general form 
(36.11), but is somewhat more complicated. To accomplish this, we 
choose new cartesian coordinates x\y^jZ' that are fixed in space. With 
respect to these, the orientation of the old axes depends on r in the follow- 
ing way (see Fig. 29) r the z axis is in the direction of r and has the polar 
angles with respect to the new axes, the y axis is perpendicular to t 
and in the plane of r and 2 ;', and the x axis is perpendicular to the plane of 
r and 2 '. If now we wish to calculate the contribution to the component 
of angular momentum from absorption by the element of area 

dA = sin QdOd<t> 
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at r, we need only given in (36.16). This can be rewidtten in terms of 
the components of the total current vector Jo along the new axes as 

k 

Pa = — ^ J?e[i(J‘o»' sin d cos ct> + Joy' sin 0 sin <^ + Jo*' cos 6) 

irV C 

’ Oy' COS ^ J oa/ sin <^>)] (36.16) 

The moment arm about the axis associated with P^ is r sin dj so that the 
differential element of angular-momentum component is 

dMgf = - r sin ^ P^r^ sin dddd(t> (36.17) 

c 

Substitution of (36.16) into (36.17) and integration over the polar angles 
gives 

^ - Jov Jo.') (36.18) 

It is appai*ent from Eq. (36.18) that the radiation of a particular 
component of angular momentum depends only on the perpendicular 
components of Jo. Moreover, there must be two such perpendiculai^ 
components that are out of phase with each other, for if Jo*' and Joy^ are 
both real or have the same phase, the parenthesis in (36.18) is zero. Thus 
a linear dipole (Jo entirely in one direction) radiates no angular momen- 
tum. The maximum angular momentum for a given value of |Jo|® is 
radiated when Jo has two components that are 90° out of phase with each 
other and the third component is zero. If the nonvanishing components 
are along x' and y' we can put Joy' = iJox'; Eq. (36.18) then becomes 

M, = g = g |J„|^ (36.19) 

and the other two components of M are zero. 

Comparison of Eqs. (36.13) and (36.19) shows that the maximum 
angular momentum radiated per unit time by an oscillating electric 
dipole is 1/kc = l/co times the energy radiated per unit time. If this 
relation is taken over into quantum theory, it shows that a quantum of 
energy, radiated by an electric dipole carries with it an amount of 
angular momentum that does not exceed h. 

Conversion from Classical to Quantum Theory. We now convert 
the classical expression (36.13) for the power radiated by an electric 
dipole, to quantum theory. This requires that we find a quantum 
analogue for the total current vector Jo and associate the radiated power 
with a transition probability between states of the particle that is doing 
the radiating. 
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We want to replace J by a current density that is associated with an 
initial upper state Uk and a final lower state Un, since energy is radiated 
during the transition from fr to n. It is natural to represent the current 
density as the product of a charge density and a velocity, and to take for 
the velocity the momentum operator divided by the mass: — (ih/m) grad. 
The charge density for a stationary state is expected to be the charge of 
the particle times its position probability density: e|^|2. However, we 
are concerned here with a transition between states, so we replace this by 
eiinUh. The way in which the grad that appears in the velocity operates 
on the wave functions that appear in the charge density is determined by 
arguments of the type presented in Sec. 7 [see Eq. (7.3)]. We thus arrive 
at a quantity to substitute for the classical current density; 

J(r) ->• - ^“n(r) grad M*(r) (36.20) 


We assume that (36.20) can be substituted into all the foregoing classical 
expressions to give quantum results.^ 

Integration of (36.20) over the coordinates gives the total current 
vector 

Jo = - ^ ^ J UnTUudr = iCWkn(j)^ (36.21) 

with the help of equation (35.20). Substitution of (36.21) into (36.13) 
then gives the radiated power. We interpret this power as the product 
of the spontaneous rate of transition from k to n, and the quantum energy 
huikn = h{Ek — En) given olT in each transition. The transition prob- 
ability per unit time for spontaneous emission then becomes 


1/ V |2 _ i/ . u 


(36.22) 


where use has been made of the relation = kc. 

Planck Distribution Formula. TIk^ transition from the classical 
expression (36.13) to the quantum expression (36.22) can claim only a 
moderate amount of plausibility. The correctness of i,ho latter result can 
however be verified by showing that Planck’s foi-mula for the spectral 
distribution of thermal radiation in a cavity follows from (36.22) and 
(35.23). This is the way in which the relation between the probabilities 


^ Thd (^xpoiuuitiiil that appears in (HC.ll), for exanijile, can he placed either before 
or after the grad operator, nineo only the component, of grad that is perpendicular to 
k enters in, 
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for absorption, induced emission, and spontaneous emission was first 
obtained.^ 

We assume that the walls of the cavity contain particles of charge e 
and mass m, each of which is bound by a potential V of the type that 
appears in (36.1). When these particles are in equilibrium with thermal 
radiation at the absolute temperature T, there must be as many quanta 
of each frequency emitted as absorbed per unit time. The rate of 
emission of quanta of frequency wkn is the sum of (36.23) and (36.22), 
multiplied by the number of particles that are in the upper state k. The 
rate of absorption of these quanta is the product of (36.23) and the 
number of particles in the lower state n. N o w from statistical mechanics, * 
the equilibrium ratio of the number of particles in the upper state to the 
number in the lower state is given by ’vvrhere k is Boltzmann's 

constant. We thus obtain, on dropping the subscripts from ojatj, 

[m + S' I W-l’] - W 

This is readily solved for I («) to give 


J(co) = 


Act) 

^2^2 (g«r — 1) 


(36.23) 


It is interesting to note that the parameters m, and (r)ftn of the particle 
that emits and absorbs the radiation drop out of the expression for /(co). 
The agreement between Eq. (36.23) and the Planck distribution formula 
provides a verification of the ratio of (35.23) to (36.22), and hence shows 
that the latter expression is correct if the former is. 

Line Breadth, A classical oscillator that radiates electromagnetic 
waves loses energy, so that the amplitude of its oscillation decreases in 
time. Thus the electromagnetic fields given off by it have a damped 
sinusoidal time dependence: cos {o)ot + a). The Fourier analysis of 

these fields gives the frequency spectrum of the radiation from the oscil- 
lator. The radiated intensity per unit frequency range at the angular 
frequency w is proportional to 


1 

(co — coo)^ + ^7^ 


(36.24) 


^ A. Eins tein, Phys. Zeiis.f 18, 121 (1917). Emstein^s A coefficient is just (36.22), 
and his B coefficient is (35.23) divided by the energy density of radiation I (cohn) I o (this 
is expressed in terms of the angular frequency rather than the circular frequency 

* See, for example, R. C. Tolman, »The Principles of Statisticftl Mechanics,” Chap. 
IV (Oxford, New York, 1938). 
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According to (36.24), the intensity of the emitted spectral line has half 
its maximum value when co = coo ± iy. The quantity y is called the 
natural line breadth, and in cases of practical interest is small in com- 
parison with coo. 

The line breadth is evidently half the initial fractional rate of decrease 
of the amplitude of the classical oscillator, or is equal to the initial frac- 
tional rate of decrease of the oscillator’s energy. It is plausible to associ- 
ate the rate of decrease of the energy of the classical oscillator with the 
rate of decrease of the probability of finding the corresponding quantum 
system in its initial upper state. If this is done, the quantum analogue of 
the classical natural line breadth y is the initial transition probability per 
unit time for spontaneous emission given in (36.22).^ 

The foregoing relation between transition probability and line breadth 
can be arrived at in a qualitative but more general way by means of the 
uncertainty relation (3.3). The reciprocal of the transition probability 
per unit time is of the order of magnitude of the time that the quantum 
system stays in its upper state. Thus a determination of the energy of 
the upper state cannot occupy a time that is of larger order of magnitude 
than the lifetime I /7 of this state. According to (3.3), this means that 
the energy cannot be determined with an accuracy much greater than 
h divided by the lifetime, or hy. If the energy of the upper state is uncer- 
tain by this amount, the frequency of the emitted line will be uncertain 
(broadened) by 7 . In general, a quantum energy level is broadened by 
any process that shortens its lifetime: the level is perfectly sharp only 
if the lifetime of the state is infinite (true stationary energy eigenfunction). 

A qualitative idea of the natural breadth of electric-dipole lines 
emitted by a quantum system can be obtained by rewriting the expres- 
sion (36.22) for 7 in the form 


7 




The factor e^/hc is a dimensionless constant that is very nearly equal to 
tJt if ^ is the electronic charge/-^ and the facitor A’‘‘*l(r)jk 7 *l“ has already been 
assumed small in comparison with unity in arriving at the dipole approxi- 
mation. Thus the ratio of line breadth to angular frequency is expected 
to be quite small (it is of the order of 10 “° for typical atomic dipole lines). 


37. SOME APPLICATIONS OF RADIATION THEORY 
The semiclassical radiation theoiy developed earlier in this chapter is 
applied in this section to the determination of the conditions for allowed 

^ For further discussion of line breadth, see Hoitlcr, op. cit., Sec. 12. 

“ This is the fine-sir u dure constant that appears in the theory of tlie fine structure 
of atomic energy levels (see CMuip. XII). 
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transitioDB, and to the theory of the Cerenkov effect. The latter problem 
is of some practical interest, and also shows how one can calculate the 
radiation from a current distribution that does not depend harmonically 
on the time. 

Selection Rules for a Single Particle. The discussion of forbidden 
transitions at the end of Sec. 35 shows that the probabilities for absorption 
and induced emission are reduced by a factor of at least {ka)'^ with respect 
to allowed transitions if the dipole matrix element (r)hn vanishes. The 
same remarks apply to the probability for spontaneous emission, since the 
integral in (36.11) is the same as that in (35.19) when the substitution 
(36.20) is made for J. 

The conditions on Uk and Un for which the dipole matrix element is 
different from zero constitute the selection rules. They are easily formu- 
lated if the potential V that appears in the unperturbed Hamiltonian of 
(35.13) is spherically symmetric. It is shown in Sec. 14 that the energy 
eigenfunctions can then be written as products of functions of the radial 
distance r and spherical harmonics defined in Eq. (14.16). 

The matrix element (!)*« is the vector whose cartesian components are 
the corresponding matrix elements of x, and z. The matrix element of 
z is Jwftrcos BundTy which can be written as a product of an integral over 
r and the angular integral 


io fo ^ sin 0d0d(l> (37.1) 

where the primed and unprimed subscripts are the angular-momentum 
quantum numbers for the lower state Un and the upper state u*, respectively. 

The <t> integration in (37.1) is which is zero unless 

m = m. The integration can then be written, apart from numerical 
factors, 

wPf{w)Pf,{w)dw, w = cos 9 (37.2) 

Now it can be sbown with the help of the generating function (14.13) for 
the associated Legendre functions that 


wPf{w) = 


I + 


21 + 1 




2i + 1 


Substitution of this into (37 .2) shows, with the help of the orthogonality 
hitegral (14.15), that the matrix element of z vanishes unless tn' = jn 
and V = I ±1. A similar treatment shows that the matrix element of 


X + iy vanishes unless m' = m - 1 and 1' = 1 + 1, and the matrix ele- 
ment of X — iy vanishes unless m' = m + 1 and V = I ± 1. These 
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selection rules determine the possible allowed (electric-dipole) transitions 
of a single charged particle that moves in a central force field. 

Polarization of Emitted Radiation. The discussion following Eq. 
(36.12) shows that the polarization of the emitted radiation is determined 
by the total current vector Jo, and hence (because of 36.21) by the dipole 
matrix element. When the initial and final states have I values that differ 
by unity and the same magnetic quantum number m with respect to the 
z axis, only the matrix element of z fails to vanish. The radiation is then 
linearly polarized along the z axis if viewed in the xy plane, and there is 
no radiation along the z axis. When the magnetic quantum numbers of 
the initial and final states differ by unity, the x and y components of the 
dipole matrix element are 90° out of phase, and the z component vanishes ; 
the radiation is then circularly polarized if viewed along the z axis, and 
linearly polaiized perpendicular to the z axis if viewed in the xy plane. 
These results are of interest in connection with the polarization of the 
radiation from atoms placed in a magnetic field (see the discussion of the 
Zeeman effect in Sec. 39). 

Conservation of Angular Momentum. The discussion of Eqs. (36.18) 
and (36.19) shows that the angular momentum that is carried away by 
an emitted quantum has its maximum value A, and is directed along the z 
axis, when J^y = iJox- From (36.21) we see that this is the case in which 
iy)kn = i{x)knj or {y)kn = Now X = r Hin e COB (j) = ir sin 6 

• and y = r sin 6 Bin ^ sin Then it is 

apparent from the ^ integration in (37.1) that in order for the matrix 
element of y to equal —i times the matrix of the magnetic quantum 
number of the initial state Uj, must be greater than the magnetic quantum 
number of the final state Un by unity. Applic.ation of Eq. (14.23) shows 
that the z component of angular momentum of the particle has decreased 
by h during the transition. Thus angular momentum is conserved between 
the radiating particle and the emitted quantum. 

The foregoing result is based on the connection between the classical 
and quantum current densities assumed in (36.20). The successful 
derivation of the Planck disti'ibution law in Sec. 30 shows that (36.20) 
is correct so far as magnitude is concerned. The above demonstration 
of the conservation of angular momentum shows in addition that the 
phases of the initial and final states are inserted properly in (36.20) ; if, 
for example, J had been assumed propoiiional to Uk grad Un, an incon- 
sistent result would have been obtained in the last paragraph. 

If the magnetic (luantiim number docs not change in a transition, only 
the matrix element of z fails to vanish, and the discussion of (36.18) shows 
that the quantum carries off no angular momentum. This might seem 
at first to be in contradiction with the (change by one unit of the orbital- 
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angular-momentum quantum number L The x and y components of the 
angular momentum of the particle do not commute with the z component 
(which in this case is known to be mh and does not change), so that they 
cannot be precisely specified. Their expectation or average values for 
states that are described by quantum numbers I and m are zero, since the 
diagonal elements of the matrices for the x and y components of the angu- 
lar momentum are all zero [see Eqs. (24.16)]. Thus there is no observable 
change in any of the components of the particle's angular momentum, 
and so the expectation value of the angular momentum carried off by 
the quantum should be zero. The x and y angular-momentum com- 
ponents for a particle in a stationary state can be thought of as fluctuat- 
ing about zero in such a way that their average values are zero, although 
their average squares are not. The change in I corresponds to changes in 
these average squares. 

Selection Rules for Many-particle Systems. When a quantum- 
mechanical system consists of several particles that do not interact with 
each other, the total Hamiltonian is simply a sum of terms like Ho + J?' 
that appear in (35.13). The unperturbed energy eigenfunctions are 
products of single-particle eigenfunctions such as are discussed in Sec. 
32 (they can be unsymmetrized if the particles are not identical). It 
is clear that the matrix element that appears in the first-order perturba- 
tion theory of Sec. 36 (absorption and induced emission) involves a 
multiple integral of the form 

; • • • K'(i)tZ6<2) 

• • • IH'il) + H'(2) + ■ • • ]Ua{l)ut{2) • ^ • drtdr, • • • 

Because of the orthogonality of different u^b for the same particle, this 
integral vanishes unless all the single-particle functions Uaf, . . - are 
equal to the corresponding functions Wa, . . . , except for one. Thus 
only one of the particles can change its state in a transition, and the 
selection rules for a central force field are precisely those given above. 
Since the spontaneous transition probability can be related to the same 
integral through the Planck distribution formula, these selection rules 
hold for spontaneous emission as well as for absorption and induced 
emission. 

If the system consists of several charged particles whose mutual 
interactions cannot be neglected, we must base the selection rules on 
general conservation laws for total angular momentum and parity. It is 
not difficult to generalize the work of Secs. 35 and 36 to show that the 
donodnant term, when the wave length of the radiation is large in com- 
parison with, the dimensions of the system, is the matrix element of the 
total electric-dipole moment eiti + eir 2 + • • • . The angular momen- 
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turn radiated by an oscillating dipole cannot, according to the discussion 
of (36.19), exceed h per quantum; this classical argument is based on an 
arbitrary current distribution, so that it is not invalidated if several 
particles contribute to the dipole. The interpretation of the conservation 
of angular momentum between the emitted quantum and the radiat- 
ing system is complicated by the semiclassical nature of the entire treat- 
ment. A consistent treatment based on quantum electrodynamics shows 
that the correct selection rule based on this conservation law is that the 
total-angular-momentum quantum number of the system can remain 
unchanged, or increase or decrease by unity. An exceptional case is 
that in which this quantum number is zero for both initial and final states. 
The initial and final wave functions are then spherically symmetric, and 
an extension of the argument given at the end of Sec. 35 shows that a 
radiative transition between these states is not only forbidden (no allowed 
dipole transition), but strictly forbidden (no first-order transition 
whatever). 

The discussion of parity in connection with Eq. (23.26) shows that the 
parity of each energy eigenfunction can be well defined (even or odd) if 
the total Hamiltonian is unchanged by reflection of the coordinates of all 
particles through the origin. All known Hamiltonians are of this type. 
Now the electric-dipole-moment operator given above is evidently odd 
with respect to reflection of all coordinates ; thus its matrix elements van- 
ish unless the initial and final states have opposite parities. This selec- 
tion rule is known as the Laporie rule. 

Cerenkov Effect. It is not difficult to show that a charged particle 
that moves freely in empty space does not radiate energy. Electro- 
magnetic radiation has the greatest momentum for a given amount of 
energy when it is all flowing in one direction, in which case the ratio of 
momentum to energy is 1/c. The relativistic relation between the total 
energy E (including rest energy) and the magnitude p of the momentum 
of a freely moving particle is + m^c^, where m is the rest mass.^ 

Differentiation of this relation gives 2EdE = 2pcHpy so that the ratio of 
momentum loss to energy loss of the particle if it radiates is 

^ = i Ti 4 - Yl“ 

dE pc^ Cl \ ?^ / J 

which is always greater than 1/c. Thus it is impossible for a freely 
moving particle to radiate, since if energy were conserved between the 
particle and the electromagnetic field, momentum would not be. 

^ See, for example, P. G. Bergiiianii, “Introduction to the Theory of Ilolativity," 
Chap. VT (Prentico-Hall, New York, 1042). 
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The situation is altered if other matter is present to take up the excess 
momentum. An electron bound in an atom or an electron that passes 
near an atomic nucleus can radiate; the latter case is usually described in 
terms of the deflection of the electron by the nucleus and the emission of 
radiation by the accelerated electron. An interesting example in which 
the presence of matter makes it possible for the consei’vation laws to be 
satisfied is provided by the uniform motion of a charged particle through 
a medium of refractive index n, when the speed v of the particle is greater 
than the speed of light in the medium: 

„ > I (37.3) 

n 

The radiation from fast electrons that satisfy (37.3) has been observed 
by Cerenkov.^ The theory of the Cerenkov effect^ is developed on the 
basis of classical radiation theory in the remainder of this section. This 
treatment is satisfactory in so far as the structure of the medium can be 
represented by a refractive index. 

Expression for the Current Density. We assume that the moving 
charge has dimensions that are small in comparison with emitted wave 
lengths of interest. Then if the particle has charge c, moves in the z 
direction with the speed v, and is at the origin at ^ = 0, the current density 
can be written 

= ev8{x)d(y)8(z — vf) 

The Dirac 8 fxmction is that introduced in Sec. 11. Since the radiation 
theory developed in Sec. 36 is based on a current density that depends 
harmonically on the time, must be expressed in terms of a Fourier 
integral 

== p [JUT)e-^‘ + o.c.]d<, (37.5) 

The Fourier amplitude of angular frequency w is given by 

JUr) = £ S(x)d{y)eT (37.6) 

This OEUi be verified by substituting (37.6) into (37.5) to obtain 
± S{x)5iy) jj 

= ^ Kx)S{y) j 

1 P. A. Cerenkov, Phys. Rev.j 52, 378 (1937). 

> I. Frank and I. Tamm, Comptes Rendua de VAcad, Sd. UM.S.S., 14 , 109 (1937). 
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which with the help of (11.10) and the fifth of Eqs. (11.13) is seen to 
agree with (37.4). 

Fourier Analysis of the Radiation Field. The electromagnetic fields 
can be expressed in analogy with (37.5) : 

E(r,<)= r [E«(r)e-'“‘ H- c.c.]da, 

(37.7) 

H(r,^) = + c,c.]dco 

Because of the linear dependence of E and H on J through MaxwelVs 
equations, each of the Fourier components E« and H« can be obtained 
entirely from the corresponding component of the current density. 
The Poynting vector obtained from (37.7) is 

^E(r,0 X H(r,0 = ^ // // [E»(r) X H„-(r)e--(“-*-')‘ 

+ Eto(r) X + E„(r) X 

+ X (37.8) 

Equation (37.8) can be used to find the spectral energy distribution 
of the radiation. The distribution of the radiated energy in frequency is 
usually of more interest than the distribution in time, if the emission 
process occupies a time that is short by laboratory standards. We there- 
fore integrate (37.8) over t from — oo to + qo to obtain the total flow of 
energy per unit area. Since t appears only in the exponentials, we can 
use Eq. (11.10) to obtain 5 functions in the frequencies. Then when the 
integration over w' is carried through, the first two terms in the integrand 
of (37.8) contribute nothing, and the last two terms yield 

= ic ^ [Ea,(r) X H„(r) + E«(r) X H«(r)]dti) (37.9) 

The integrand of (37.9) is just twice the real part of Ea,(r) X Hw(r), and 
in accordance with (30.8) can be called (47r/6')P«(r). Since (37.9) has 
the form of an integral over frequency, we see that 27rPto(r)da) is the flow 
per unit area of energy that lies in the angular-frequency range w to 

CO + do3. 

We assume that the medium with which we are concerned is a homo- 
geneous isotropic dielectric that is characterized by a dielectric constant 
or specific inductive capacity e. This means that c is to be replaced by 
c/e* = c/n in Secs. 35 and 36, where the refractive index n of the medium 
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is equal to e*; also, k is veplaced by nk. Thus (36.11) becomes, in our 
present notation, 

U Jixjf) exp i-ink • x')dT' 

Here Pi^ is the component of P« in the direction of observation (parallel 
to k or r), and the magnitude of k has the earlier value w/c. 

Radiated Energy. The direction of observation is taken to have 
polar angles 6j<l> with respect to the z axis. Then k • r' in (37,10) is equal 
to k{x' sin ^ cos <#» + 1 /' sin ^ sin + z' cos d)j and Jxk,oj = sin 6, 
Substitution of (37.6) into (37.10) then gives for the energy flow per unit 
area and angular frequency 


(37.10) 


27 rP&«(r) 2ir 2^2^ ^2 sin^ e 


Uii2 

V 


ne^g?^ sin^ 6 


I I I S(x')S(y'}e 

, ^ink sin ^ coH <^+1/' nln 4+*^ oob 


{37.11) 


In order that (37.11) be definite, the particle can be considered as radiat- 
ing only over a finite length L of its path, so that the integral over 2 ' 
extends, say, from --J-L to -^L. 

The squared integral in (37.11) is easily evaluated to give 


j /I _ n cos g y 


(37.12) 


For large L, this has a sharp maximum when the denominator is zero, at 


cosi9o = - (37.13) 

nv 

Thus the radiation appears at a cone in the forward direction whose half 
angle decreases as rw/c decreases. Since ^0 becomes imaginary if ?; < c/n, 
there is no radiation if the condition (37.3) is violated ; in particular, there 
is no radiation if w = 1 (empty space), since v is always less then c. The 
fact that (37.12) does not quite vanish for cos 6 less than unity even if 
V <i c/n is due to the choice of a finite path length L; the starting and 
stopping of the charge at the ends of its path mean that the particle is 
accelerated, and this makes some radiation possible. 
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The total energy radiated involves an integral of (37.11) over the 
surface of a sphere of radius r. This can be evaluated like the integral in 

(29.11) , by making use of the sharpness of the maximum to extend the 
limits on cos 0 to ± oo and to replace sin^ 6 by sin^ obtained from 
(37.13). The integi’al of (37.12) is ^r^cLr^Jno), Substitution into 

(37.11) gives for the total energy radiated by the particle per unit angular- 
frequency range in the distance L 

(i)e^L sin^ 6q A c® \ 

7 ^ ^ V 

The number of quanta with angular frequency between w and co + dco 
emitted per unit distance by a particle of charge e moving with a speed v 
through a dielectric of refractive index n is then 

»7.i4) 

if 6 is the electronic charge. Thus the number of quanta emitted per 
unit frequency range depends on co only through n. For a very fast elec- 
tron {v ^c) passing through water (n ^ 1,33), Eq. (37.14) shows that 
there are about 230 quanta emitted per centimeter of path in the visible 
region (wave lengths from 4,000 to 7,600 A). 

Problems 

1. Show that if J = p « 0, the most general solution of MajcwelPs equations oan 
be expressed in terms of potentials such that div A «</>•= 0. 

2. Estimate the order of magnitude of pA/cp, when e is tho electronic charge, 
A is the magniUido of the vector potential for the visible part of the spectrum that 
corresponds to the radiation in a cavity at several thousand degrees centigrade, and 
p is the momentum of an electron in the first excited state of hydrogen. 

3. Verify Eq. (36.20) by means of the wave equation, without recourse to matrix 
methods. 

4. Show that the transition probability for spontaneous emission is equal to the 
transition probability for induced emission that would rcs\ilt from an isotropic field of 
such intensity that there is one quantum per state of the field in tho neighljorhood of 
the transition frequency. 

B. A hydrogen atom in its first excited (2P) state is placed in a cavity. At what 
Uunperature of the cavity are the transition probabilities for spontanoous and induced 
emission equal? 

6. What is the spontaneous transition probability per unit time, expressed in 
sec”^, of a hydrogen atom in its first excited state? 

7. Wliat is tho selection ruhi for allowed transitions of a linear harmonica oscillator? 
What is tho spontaneous transition probability per unit time,‘expniBsed in soc""^, of an 
oscillator in its first excited slate, when e, and co arc^ tho same as in Prob, 0? 

8 . Show that a logarithmic factor like that in Eq. (30.11) always appears in the 
cross section for excitation of a onc-cicctroii atom by electron impact, if tho corro^ 
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spending radiative transition is allowed. Derive the simplest relation that you oan 
between the differential cross section for excitation by electron impact and the corre- 
sponding transition probability for spontaneous emission, assuming the transition 
is allowed. 

9. Starting from Eq. (37.5), derive a general expression for t7‘*u(r) in terms of 
assuming that the latter is real. Use this to verify Eq. (37.6) when t/^(r,0 is 
given by (37,4). 

10. What is the polarization of the Cerenkov radiation? Is any angular momen- 
tum carried off by the radiation field in this case? 



CHAPTER XI 

ATOMS, MOLECULES, AKD ATOMIC NUCLEI 


This chapter is not intended to be a complete survey of the properties 
of atomic systems. It is primarily a presentation of a relatively few 
problems that arise in connection with the structure of matter, selected 
because they supply interesting and instructive applications of quantiun 
mechanics. These problems are grouped according to subject; enough 
explanatoiy material is included so that the treatment is coherent, 
although severely limited in scope. 

38. APPROXIMATIONS IN ATOMIC STRUCTURE^ 

The ground states of the two lightest atoms, hydrogen and helium, were 
considered in Secs. 16 and 27, respectively. Variation calculations simi- 
lar to those described for helium have been carried through for others of 
the light atoms. This section descril}es some of the approximations that 
have been used for the heavier atoms. The alkali atoms are discussed 
separately in Sec. 39. 

Central-field Approximation. The starting point of calculations on aU 
except the lightest atoms is the central-field approximation. This assumes 
that each of the atomic electrons moves in a spherically symmetric poten- 
tial energy y(r) that is produced by the nucleus and all the other elec- 
trons. The approximation is a good one if the deviation from the Y{r) 
for one ele(;trou produced by close passage of other electrons is relatively 
small. This is actually the case, since the constant nuclear potential is 
of the order of Z times as large as the fluctuating potential due to each 
nearby electron, and the latter varies quite slowly (inversely) with the 
separarion distance. The two principal problems are then the calcula- 
tion of the central field, and the coiTCction of the approximate results 
obtained from it. l^efore considering these problems, we discuss some 
general properties of the central field. 

The potential energy F(r) for a neutral atom has the Coulomb form 
—e^/r at a great distance r from the nucleus, since the removal of the 
electron whose potential is being measured leaves a singly charged positive 

^ For a moro detiiilod diHoiiBHion of tho material of this section and the next, see 
E. U. Condon and C. H. SliorU(‘y, “The Theory of Atomic Spectra*' (Cambridge, 
London, and Macmiilan, New York, 1935). 
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ion. The electron in the hydrogen atom, for which the potential energy 
is at all r, was shown in Sec. 16 to have an infinite number of 

bound energy levels characterized by the quantum numbers n, Z, and m. 
An infinite number of energy levels is also expected for 7(r), since for 
large n, the electron wave function is small near the nucleus, and only the 
form of V{r) for large r is significant. An important difference between 
the two situations is that the degeneracy between states of the same n 
and different I that occurs in hydrogen is removed in a non-Coulomb 
central field. This is because the electrons that have smaller angulai’ 
momentum penetrate closer to the nucleus, and V{r) is stronger (more 
negative) than —e^/r there, since the nucleus is less completely screened 
by the other electrons. Thus for given n, the states of lowest I have the 
lowest energy. The degeneracy with respect to m is not affected, since 
this occurs whenever the potential is spherically symmetric. 

Because of the spin, four quantum numbers n, Z, mz, and are 
required to specify the state of an electron in a central field. The orbital 
quantum numbers I and mi are the same as Z and m in the hydrogen atom, 
specifies the spin orientation, and ti is a natural generalization 
of the total quantum number that appears in hydrogen. Equation 
(16.14) shows that n — Z — 1 is the number of nodes of the radial part 
of the hydrogen wave function ; this definition of n is carried over to the 
general central field, so that Z does not exceed n — 1. 

Periodic System of the Elements. According to the Pauli exclusion 
principle (see the discussion of antisymmetric wave functions in Sec. 32), 
not more than one electron in an atom can have a particular set of values 
of the four quantum numbers given above. As Z increases, electrons fill 
the one-electron states of lowest energy in succession; the ground state 
of an atom in the central-field approximation is that in which there are 
no unfilled electron states that have lower energy than any that are 
occupied. Because of the degeneracy with respect to mi and there 
can be as many as 2(2Z + 1) electrons with the same energy in a shell 
that is specified by n and Z. It is apparent then that the ground-state 
configuration of the electrons in an atom can be described by specif 3 dng 
the number of electrons in each shell. In the central-field approximation, 
aU shells that contain any electrons are full except perhaps that which 
has the highest energy. 

The chemical properties of atoms are determined for the most part 
by the least tightly bound, or valence^ electrons, which are in the shell of 
highest energy. The most important factors are the number of occupied 
and unoccupied electron states in this shell, and the energy interval 
between this and the next higher (empty) shell. For example, an atom 
tends to be chemically inert if its highest shell is full and there is an 
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appreciable energy gap to the next higher shell, since then electrons are 
not readily shared with other atoms to form a molecule. The quasi- 
periodic recurrence of similar highest shell structures as Z increases is 
responsible for the periodic system of the chemical elements. 

In the usual spectroscopic notation, the n value of a shell is given as a 
number, the I value as a letter, and the number of electrons in the shell 
as a numerical supei’script. The letter code for I, and the maximum num- 
ber 2(21 + 1) of electrons in a shell, are as follows: 

Z = 0, 1, 2, 3, 4, 5, . . . 

Vi f) U) Zi, . . . 

2(2Z + 1) = 2, 6, 10, 14, 18, 22, . . . 

For example, the ground-state configurations of sodium {Z — 11) and of 
mercury (Z = 80) are 

Na: lsW2p«3s 

Hg : Is22s22p«3s23p®4fi23dio4p06s24di»5p86fi24/^^5dio 

The ground-state configurations of many of the elements can be 
written down simply from a knowledge of the order in which the energies 
of the shells increase. This order can be inferred from spectroscopic 
evidence, and is as follows : 

ls,2s,2p,3s,3p,[4s,3d],4p,[5s,4dZ],5p,[6s,4/,5cZ],6p,[75,6/,6d] 

The brackets enclose shells that have so nearly the same energy that they 
are not always filled in sequence. These shell energies are close together 
because the increase in n and the decrease in I tend to compensate each 
other; thus the 4s state, which has a higher energy than the 3cZ state in 
hydrogen, is depressed by the penetration caused by its low angular 
momentum. The s shell in each bracket is always filled first, although it 
can lose one or both of its electrons as the other shells in the bracket fill 
up. Apaidi from the brackets, there are no deviations from the indicated 
order of filling. 

Table 2 gives the ground-state configurations of each of the elements.^ 
An atom contains all the full shells that occur above and to the left of its 
position in the table. Since the number of 5 electrons varies as each d 
shell fills, the d columns are subdivided to show the number of s electrons. 

' This table is taken from Condon and Shortloy, op. ciL, p. 333. Rccont data and 
estimates on the rare (earths, the honviest natural elements, and the artificial trans- 
uranifi elements, neptunium (Np, 93), plutonium (Pu, 94), americium (Am, 95), and 
curium (Cm, 96), arc reviewed by W. F. MegRcra, Scieni'e^, 106, 614 (1947); the ele- 
ment names technetium (Tc, 43), astatine (At, 85), and fnineium (Fa, 87) are as 
given by O. T. Seaborg, Science, 106, 349 (1947). 
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Table 2. Ground-State Electron Configurations of the Elements 
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The two groups of atoms that appear to have a partially fuU f shell in 
their ground-state configurations fit in at * (rare earths) and at f (heaviest 
elements). The first group has the 6s shell full and the second has the 
7s shell full; the distribution of electrons in d and / shells for each group is 
shown below the main table. According to Meggers, the entries in the / 
groups that are enclosed in parentheses are extrapolations or predictions 
based on the analyses of neighboring spectra. 

A few of the periodicities are worth explicit mention. The first 
electron to go into each s shell beyond Is gives an alkali, and the elements 
just before each of these (full Is shell or a full p shell) are rare gases. The 
elements with the same number of electrons in a p shell have similar 
chemical properties; this is especially striking in the case of the halogens 
(one electron short of a full p shell). The elements with full 2s and 3s 
shells (Be and Mg) that are followed by p shells have somewhat different 
properties from the alkaline earths, which have full s shells followed by d 
or / shells. The filling of the 4s, 3d shells gives elements somewhat similar 
to those arising from the filling of the 5s, 4d shells. The elements that 
correspond to full bracketed shells (Zn, Cd, and Hg) are quite similar, as 
are the noble metals (Cu, Ag, and Au) in which an s electron is missing 
from the full bracketed shells. 

Thomas-Fermi Statistical Model. We now turn to the first of the 
problems associated with the central-field approximation. There are two 
methods that have been used for the determination of the potential 
energy 7(r). The first of these, due to Thomas^ and Fermi, ^ is discussed 
here, and the second, due to Hartree, is taken up later. The Thomas- 
Fermi statistical model assume*s that 7(0 varies slowly enough in an 
electron wave length so that many electrons can be localized within a 
volume over which the potential changes by a small fraction of itself. 
The electrons can then be treated by statistical mechanics, and obey the 
Fermi-Dirac statistics mentioned in Sec. 32. At normal temperatures, 
the thermal energy kT is very small in comparison with 7(r) eveiy where 
except at the edge of the atom, where the chance of finding an electron is 
small. In this case, the Fermi-Dirac statistics ro(iuiros that the electron 
states fill in order of increasing energy, tis assumed above. The difference 
between the present treatment and the more general discussion given 
earlier in this section lies in the additional assumption that 7(r) is 
sensibly constant over a region in which many electrons can be localized. 

The number of electron states in a cube of edge length L at the walls 
of which the wave functions obey periodic boundary conditions was 
computed in Sec. 11 to be {L/2ir)hik^ikydk,, This must be multiplied by 

iL. II. Thomas, Proc. Camb. Phil. Soc., 23, 642 (1927). 

2 E. Fermi, ZcUh. f. Phf/aikj 48, 73 (1928). 
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2 to take account of the two possible spin states; then the number of 
states for which the magnitude of the momentum p = Ak is less than or 
equal to po is 

Kk)T /.T is 

If all these states are occupied, the number of electrons per unit volume 
whose kinetic energy does not exceed Po/2m is Now the maxi- 

mum kinetic energy at any distance r from the nucleus is —1^(7’), since 
otherwise electrons would escape from the atom. We thus obtain a rela- 
tion between the volume density of electrons, n(r), and the potential 
energy 

n(r) = (38-1) 

The electrostatic potential V(r)/e is also determined by Poisson^s 
equation in terms of the charge density en{r) 

Equations (38.1) and (38.2) are two simultaneous equations for n and V. 
The boundary conditions on the solutions can be expressed in terms of V 
alone for a neutral atom of atomic number Z. As r — > 0, the leading term 
in the potential energy must be due to the nucleus, so that 7(r) — > 

As CO j there must be no net charge inside the sphere of radius r, 
so that V falls off more rapidly than 1/r, and rV{r) 0. The boundary 
condition at infinity is different from that assumed earlier in this section, 
where V was taken to have the asymptotic form —e^/r. The V discussed 
earlier is the potential experienced by one of the atomic electrons, while 
the Thomafl-Fermi potential is that experienced by an infinitesimal test 
charge. The difference between the two potentials emphasizes the 
statistical nature of the approximation made by Thomas and Fermi. 
The solution for V is exact in the limit in which m becomes infinite and 
e becomes zero in such a way that remains constant; then the elec- 
tron wave length becomes zero, and the density of particles becomes 
infinite. In this limit the potential is constant over many wave lengths, 
and enough particles are present so that statistical mechanics can be 
applied. 

Evaluation of the Potential. Elimination of n(r) from Eqs. (38.1) 
and (38.2) leads to an equation for *-F(r) 

LA \ = M"2m7(r)]8 

dr I dr J SttA* 


(38.3) 
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Equation (38.3) and the boundary conditions given above are conven- 
iently expressed in a dimensionless form in which Z, E, m, and h appear 
only in scale factors. We put 


^ X, r = hx 

’ = i A" ^ 0.885fflo 

^ 2 \ 4 / me^Z^ Zi 


(38.4) 


where ao = h^/me^. With thewe substitutions, (38,3) becomes 


® dx^ ^ 

X = 1 at o; == 0, X = 0 at a* 00 


(38.6) 


The most accurate solution of Eq, (38.6) was computed by Buish and 
CaldwelB with the help of the differential analyzer, and is expressed in 
the form of a numerical table. 

Equations (38.4) show that the radius” of an atom is inversely 
proportional to the cube root of the atomic number, if this radius is inter- 
preted to be that of a sphere that encloses a fixed fraction of all the elec- 
trons (see Prob. 1). These equations can also be used to show that the 
Thomos-Fermi approximation improves with increasing Z. The poten- 
tial at the atomic radius is proportional to Z*, so that a typical electron 
wave length is proportional to Z-“3, The distance over which the poten- 
tial changes by a definite fraction of itself is proportional to the atomic 
radius, or Z”*. Thus the fractional change of the potential in an electron 
wave length is proportional to Z~*, and decreases with increasing Z. 
Moreover, since the number of electrons is equal to Z, the use of the 
statistical method is better justified as Z increases. 

Hartree’s Self-consistent Fields. The sc^cond method for obtaining 
a central field is due to Ilartree.^ This model assumes that each electron 
moves in a central field that can be calculated from the nuclear potential 
and the wave functions of all the other electrons, by assuming that the 
charge density associated with an electron is e times its position prob- 
ability density. The Schrodingcr ecpuition is solved for each electron in 
its own central field, and the resulting wave functions made consistent 
with the fields from Avhicli they arc cahuilated. Thus the A’th electron is 
described by a normalized wave function Uhiji) that is a solution of the 
equation 


^ V. Bush and 8. II. (^aldwoll, Phy». Rvv., 38, 181)8 (11)31). 
2 D. H. Hartroo, Proc.. Carnb. Phil. Soc.^ 24, 111 (1928). 
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j^— ^ V| — "t" ^ y j “ **“*(^ 4 ) (38.6) 

js^h 

where r/i, = jr^ — r*]. If there are Z electrons in the atom, (38.6) con- 
stitutes a set of Z simultaneous nonlinear integrodifferential equations 
for the Z functions Ukitk)* It is therefore not feasible to solve these equa- 
tions directly, and Hartree used a method of successive approximations. 

A potential energy that approximately represents the second and 
third terms in (38.6) is assumed, electron wave functions computed, and 
new potentials for each electron found from these wave functions. This 
process is continued until the potentials are self-consistent to a high order 
of accuracy. The principal approximation made is the averaging of the 
potential energy given as the third term in (38.6) over the angles of 
rjb to make it spherically symmetric. The solutions of (38.6) can then be 
expressed as products of radial functions and spherical harmonics. A 
further simplification is made so that the 2(2Z + 1) or fewer electrons in a 
shell all move in the same potential and have the same radial wave 
function. 

It is apparent that the Hartree approximation neglects correlations 
between the positions of the electrons, since the entire wave function 
for all the electrons is assumed to be a simple product of one-electron 
functions 


^(ri>r2> - . . jfz) — Wi(r 1 ) 0 ^ 2 ( 12 ) ' * * Uzitz) (38.7) 

It is also clear from (38.7) that antisymmetrized wave functions are not 
employed. The antisymmetry is considered only in so far as the quan- 
tum numbers of the one-electron states are chosen in agreement with 
the exclusion principle. 

Connection with the Variation Method. We now show that the 
Hartree approximation results from an optimum variation calculation 
with the trial function (38.7).^ The wave equation with inclusion of 
interdectrordc interactions but neglect of spin-orbit terms (see below) is 


H\l/ = Exf/ 



(38.8) 


where j > k implies a double summation, over all different pail's of 
indices j and k. We wish to minimize the expectation value of 11. 


1 J. C. Slater, Phys. Rev., 35, 210 (1030) ; V. Fock, Zeita.f. Physik., 61, 126 (1930). 
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From (38.7) and (38.8) we obtain 


J ' ' ' J ■ • • drx 

= ^ / “*(*^*) (“ “ Tt) 

+ y V f f UjiXi)uk{^k) ^^Uj(ij)uk{Xk)dTjdTk (38.9) 

3>k 

since the m*, are normalized. The optimum ^ is obtained by varying 
each of the u* separately to minimize (38.9). The only dependence of 
(38.9) on a particular one-electron function Uk is through the terms 


J UkiXk) ^ ‘^k{tk)d7k 


UjiXi)ukiXk) — u,itj)Uk{Tk)dTfdTk 
Tjh 


Iff 

= j ilk{rk)Hktik(jk)dTk (38.10) 


The integral in (38.10) is the expectation value of the operator 77* for 
the function li*. From the discussion of Sec. 27, it follows that this is a 
minimum when Uk is an eigenfunction of Hk that corresponds to its low- 
est eigenvalue ej. 

HkUk = ekUk (38.11) 


Since Eqs. (38.11) and (38.6) are identical, wo see that the Hartree wave 
functions are the l>est from the point of view of the valuation method that 
can be w'ritten in the form (38.7). 

The energy associated with this wave function is just the integral 
(.38.9), whicili can be Avritten with the help of (38.6) 


/ ■/ 


^Ilxl/dri 


drz = ^ 

k 

-nif |?i,-(r,-)l“|Mt(rfc)|“ ^ dTjdrk (38.12) 


The electrostatic interaction terms between ehictrons are counted twice 
in the summation over and so have to l)c subtracted out to give (38.12). 
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Thus the enei’gy of the atom is not just the sum of the €*, although each 
eh is roughly the energy of removal of the A*th electron. This last is not 
strictly true, since the removal of an electron alters the self-consistent 
fields, and hence the wave functions and €^s for the remaining electrons. 
However, e* is found to be an especially good approximation to the energy 
of removal in the case of an inner electron (X-ray level). 

Corrections to the Central-field Approximation. We now turn to the 
second problem mentioned at the beginning of this section, the correction 
of the approximate results obtained from the central field. Two terms 
are omitted in the central-field approximation: the difference between the 
actual electrostatic interaction between electrons and the average inter- 
action that is included in the central field, and the spin-orbit energy. 
The latter is an interaction energy between the spin and the orbital 
motion of each electron, and has the form 

y f(r*)L, • Sfc (38.13) 

k 

Here, L* is the orbital-angular-momentum operator r* X Pa of the frth 
electron, and has the properties of the M operator introduced in Sec. 24; 
the eigenvalues of LJ and Lkz are given in terms of the quantum numbers 
I and nil for the Kh electron as + 1)^^ and respectively. is the 
spin angular momentum ^h6k of the A-th electron that was introduced in 
Sec. 33. The function ^(r) is given by^ 

- SWTT 

in terms of the central-field potential energy F(r). 

In considering the effects of these terms, we shall assume that the 
perturbed eigenfunctions, which are linear combinations of various con- 
figuration wave functions, have only negligibly small amounts of all but 
one configuration mixed in them. From Eq. (25.9), it is apparent that 
this is the case if the interconfiguration matrix elements of the perturba- 
tion are small in comparison with the energy intervals between unper- 
turbed configuration energies. 

It can be shown that the part of the summation in (38.13) that 
includes electrons in full shells is zero, since the function f is the same for 
all electrons in a shell and the contributions from electrons with opposite 
mi and cancel. Thus the electrons in full shells can be ignored and the 

^L. H. Thomas, Nalme, 117, 614 (1926). This energy is a consequence of rehi- 
tivity, and is derived as such in Chap. XII. It was first obtained from the procession 
of the spin axis of the electron, part of which is of electromagnetic origin (I^armor 
precession) and part of which comes from relativistic kinematics (Thomas precession). 
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summation extended only over the remaining electrons. The case in 
which there is just one electron outside full shells is of interest in connec- 
tion with the ground state and low excited states of the alkali atoms, and 
will be discussed in some detail in the next section. For the present, we 
consider very briefly the more general situation, always assuming that 
each atomic state is based on just one configuration of the electrons. 

LS Coupling Scheme. There are, in general, a number of states that 
belong to the same configuration, and that are degenerate in the central- 
field approximation. These states differ in the assignment of mj and 
quantum numbers to the individual electrons. The theory of complex 
spectra consists in determining the linear combinations of such suitably 
antisymmetrized wave functions that diagonalize the perturbation to 
first order (see Sec. 25), along with the corresponding perturbed energy 
levels. 

The most usual situation is that in which the hitherto-neglected 
electrostatic terms are larger than the spin-orbit energy; this is called the 
RusselUSaunders case.'^ States of the same configuration can be clas- 
sified as eigenfunctions of any dynamical variables that commute with the 
Hamiltonian and hence are constants of the motion (see Sec. 23). When 
all perturbations are included, the only true constants of the motion are 
the total parity and the total angular momentum J of the electrons 

J=L + S = 2(L*.-H-S*) (38.16) 

k 

J is a constant because the angles that specify the orientation of tho 
atom as a whole, and that are the canonically conjugate variables to the 
components of J, do not appear in the Hamiltonian of an isolated atom. 
When the electrostatic perturbation is included but the spin-orbit energy 
neglected, the same argument can be applied to show that the total orbital 
angular momentum L and the total spin angular momentum S are 
separately constants of the motion. Tho individual S* need not be 
constants, even though no spin-dependent forces act in this approxima- 
tion, since the use of antisymmetric wave functions couples the spias to 
the ele(d.rostaii(? energy (see the discussion of the excited states of helium 
in Sec. 33). 

A state call bo specified by the quantum numbers J, L, S, M, Mr,, and 
Ms, which are connoc^ted with eigenvalues of angular-momontiun opera- 
tors through 

P = J{J -f- l)h\ ./« = Mh 

L2 = L{L + l)/i^ L, = Mi.h 

- S{S + l)h\ Sr = Msh 

^ H. N. Russell and F. A. Saunders, Astrophys. Jour., 61, 38 (1926). 


(38.16) 
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When, the spin-orbit energy is neglected, the electruntalic energy separates 
states of different L; in some cases, only particular S values are permitted 
because of the exclusion principle. Only two of the other four quantum 
numbers are independent, so we can use either LSM lM or LSJM to 
specify a state. Because of the spherical symmetry of the Hamiltonian 
with respect to its space and spin parts separately, the energy is inde- 
pendent of the directional quantum numbers Ml and and tlicre are 
(2L + 1)(2/S + 1) degenerate states. For given L and S, the states 
specified by J and M are linear combinations of those specified hy Ml 
and Msj so that the same amount of degeneracy appeal’s in tho LSJM 
representation. This is called the LS coupling scheme, wince th(3 indi- 
vidual Lfc are coupled together to form the total L, and the individual Rk 
to form the total S. 

If now the spin-orbit energy is included, L and S are no longer con- 
stants of the motion, although J and M still are. However, we tiwsiimo 
that states of different L and ;S are sufficiently well separated by tho 
electrostatic energy so that their mixing due to spin-orbit energy can bo 
neglected. This is analogous to the earlier assumption that dilTcrenfc 
configurations are sufficiently well separated by the central field so that 
their mixing due to the electrostatic energy can be neglected. States of 
different J in the LSJM representation are now split apart by tho spin- 
orbit energy; the energy is still independent of ilf , so tliat there are 2./ + I 
degenerate states. A Russell-Saunders state is . usually wi'itton in the 
form where the superscript is the m.uUiplirUij 2S + 1, tho h^l.tor 
(now capitalized) is theL value according to the (^()d(^ given earlier in this 
section, and the subscript is the J value; in this case S = §, L = 2, anil 
/ = i. Since J = L + S, the argument at the end of Hoe. 24 shows thal. 
J can only be one of the numbers L + S, L + S — ] , . . , , “ *^1- 

Selection Rules, The selection rules in the llussell-Saundors ease 
can be obtained from the discussion of Sec. 37. Only one clootron is 
involved in a transition, so that the configuration cliangew through ii 
change in one of the Ts by one unit; this also chang(‘s the parity. Hince 
the electric-dipole moment does not involve tho spins, and the spin func- 
tions for different 8 are orthogonal (see Prob. 4), S do(\s not chtingi^ in an 
allowed transition. The conservation of angular momoniuin l)^d.^v^H*u 
atom and radiation field further requires that J and L oac;li churigi? hy 
1 or 0. Transitions between states both of which have / = 0 are strictly 
forbidden. 

Intersystem lines that join states of different multiplicity (chungo in 
S) sometimes occur, and indicate a partial breakdow n of LS coiii’jling. 
The very intense mercury resonance line at 2,537 A iw an intorsystnm 
line: -> ^So. This transition is allowed so far aw tlie changes in ./, L, 
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configuration and parity are concerned, but not as regards the change in 
iS. The ®Pi state is partially mixed by the spin-orbit energy with a 
higher singlet {S = 0) state of the same J and parity, and this makes an 
electric-dipole transition possible. 

jj Coupling Scheme. The opposite approximation to that involved 
in LS coupling assumes that the spin-orbit energy is large in comparison 
with the electrostatic energy. If the latter is neglected, each electron 
can be characterized by the quantum numbers nljm rather than nZmjma, 
where (L* + SkY = j{j + l)h^ and Lkz + Skz = The electrostatic 
energy?' then splits apart states of different J. 

This is called the jj coupling scheme, since the orbital and spin angular 
momenta of the individual electrons are coupled together to form j’s, 
from which the states are built up. It is mainly of interest in heavy 
atoms, where the large V{r) makes the spin-orbit energy (38.13) the 
dominant perturbation. 


39. THE ALKALI ATOMS 

The ground-state configuration of an alkali atom consists of a series 
of full shells followed by a single s electron, and so is The inner rare- 
gas configuration is so stable that all but (piite high excited states of the 
atom involve only the valence electron. Thus the alkalis can be treated 
to quite good approximation in terms of a model in wliich a single electron 
moves in a spherically symmetric non-Coulomb potential energy F(r). 
In this section we calculate the energy levels and the intensities of allowed 
transil-ions in the al)sen(*.e and presence of an external magnetic field. 

Doublet Separation. The configuration of an alkali atom can be 
specified by a singles pair of ciiiantiim numbers nL Since there is only 
one electron, the pcM’turbing elcu'-trostatic term num tinned in the last sec- 
tion does noi. apix'ar. In tlie absencx^ of external fieltls the Hamiltonian, 
including the spin-orbit energy (38.13), is 

// = - + Vir) + ?(r)L • S (30.1) 

where f(r) is given by (38.14). As in Sec. 38, wo neglect the mixing of 
different configurations produced by the spin-orl)ii- encM’gy, and regard 
this term as a perl urbation that rtunoves the wurna degenera(;y within each 
configuration. The total angular moment, uin J = L -|- S of the valence 
electron is a c.onsi.ant of the motion (sc^e Prob. 5), so that sl.ates (;an be 
designated by jm instead of wlux’e = ;/(;/ + 1)A“ and ,/g = 7nh. 

The states of different^’ liavc different energies, l)ut there is still a (2;/ + l)- 
fold d(;generacy due to m. The removal of th(^ vi degeneracy by a mag- 
netic field is discussed later in this section. 
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The difference in energy between states of different j is due to the 
L * S term in (39.1), and can be found from its expectation value or 
diagonal matrix element [see Eq. (26.8)]. We have the operator relation 


P = (L + S)2 = L2 + S2 + 2 L • S (39.2) 

Since Z, j, and s are all good quantum numbers (s = -y for one electron), 
Eq. (39.2) can be solved for the diagonal matrix element of L * S 

(L ■ S)^,.z; = mi + 1) “ + 1) ~ (39.3) 

Now if Z is different from 0, j can be either Z + or Z — Thus the 
first-order perturbation arising from f (O^^ ’ S is 


iZfnz if i = Z + 

“^•(Z + l)fnz if i = ^ i 

f»: = A* >„z(r)|*f(r)r=dr, I > 0 


(39.4) 


where Rvx{t) is the normalized radial part of the unperturbed eigenfunc- 
tion associated \vith the nZ configuration. Since y’(r) represents an 
attractive potential energy, £(r) given by (38.14) is positive and is 
positive. Thus (39.4) shows that the state with higher j has the higlier 
energy. The pair of states is called a do^Met; the doublet structure 
characterizes all the moderately excited levels of the alkali atoms except 
those for which Z = 0, in which case j can only be 

The doublet separations can be calculated from (39.4) if the radial 
function is known. We can get a rough estimate of their dependence on 
n by using the hydrogenic wave functions given in Eq. (16.24), and assum- 
ing that F(r) has the Coulomb form —Ze^/r. Substitution into (38.14) 
and (39.4) gives, with the help of the generating function (16.21) for the 
associated Laguerre polynomials, 


^ _ h^Ze^ 

2mvjo 

c-WZ^ 

” + i)(l +T) 

This is valid only for Z > 0; the singularity in K^) at r = 0 makes the 
integral for fno diverge there, so that the perturbation approximation in 
not valid. It follows from (39.4) and (39.5) that the doublet separation is 
proportional to n“*, and this is in fair agreement with observation. The 
absolute value of the doublet separation and its dependence on Z are not 
given at all by this simple theory, since the effective Z is difficult to 
estimate and depends markedly on Z because of penetration,^ 


^ The effect of the spin-orbit interaction on the energy levels of hydrogen is fo\md 
in the next chapter aa part of an exact relativistic calculation. 
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Doublet Intensity. We now calculate the relative intensities of the 
two lines of the allowed doublet and under the 

assumption that the radial wave functions are the same for the two excited 
^P states. Transitions of this type give rise to the principal series in the 
alkali spectra.- From Eq. (36.22), the spontaneous transition probabili- 
ties, and hence the observed intensities if the two P states are equally 
likely to be occupied, are proportional to the squares of the dipole matrix 
elements.^ 

The dependence of the two excited ^P states and the ground state 
on the angular and spin coordinates of the electron is obtained by finding 
linear combinations of products of the four spherical harmonics Yi,i{d,<l}), 
Yi,Q{6,<j)), Fi.-i(0,</>) and Yo,o{9,<l)), and the two spin wave functions (+) 
and (— ), that are eigenfunctions of J® and J* (see Secs. 14 and 33 for dis- 
cussion of the angle and spin functions). These combinations can be 
obtained from the spin functions for three electrons given at the end of 
Sec. 33. We replace Si by S, S 2 + Ss by L, and the spin functions (33.6) 
for the second and third electrons by the spherical harmonics Fi,i, 
71,0, 7i.-i, and 7o,o, respectively (see Prob. 3, Chap. IX). This gives at 


once 


/ m 
2Pj j ^ 


■ff 




m 




(+)7i,i 

3-i[2K+)J^M + (-)7i.J 
3-H2K“)I"m + (+)7w] 

3-*[(+)7i.o - 2i(-)7M] 
3-i[(-)7i.o -2K+)7i._i] 
(+)I"o.o 


(39.6) 


The wave functions (39.6) can be used to calculate the matrix ele- 
ments of .r = r sin B q.q^ 4>, y — r sin 6 sin <^, and ^ = r cos 6, We assume 
that the radial functions associated with (39.6) are all the same, so that 
the radial part of the matrix-element integral is a common factor through- 
out. The angle parts of the integrals are easily evaluated by making use 
of the explicit expressions for the 7^s in terms of 9 and <t> given in (14.16).® 
The products of spin functions follow the simple rules (+)*(+) = 1, 
(■“)*(+) = Oj Gtc. In this way we obtain the following values for the 
squares of the magnitudes of the indicated matrix elements, expressed in 
units of tV of the common radial factor: 

^ The energy difference between the two upper states is so small that the factor 
in (36.22) does not affect the intensity ratio appreciably. 

* In the general case in which Tim's with I > 1 arc involved, it is often easier to 
use a formula for the integral of the product of three spherical harmonics given by 
J. A. Gaunt; see Condon and Shortley, op. cit., p. 176. 
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»Pi -> 


' m = ■§ to m = 

i 

= \y\^ = 3, 

= 0 

1 1 to 

-i 

= W = N'-* 

= 0 

1 i to 


|a;|a = \y\i = 0, 

= 4 

, i to 


la-ji! == jj,j2 

= 0 

1 m = -^ to m = 


\x\^ = \y\^ = 0, 

la|» = 2 

1 ito 


1^-r = w = 2, 

= 0 


(39.7) 


Similar results are obtained for the transitions that start from m = — ^ 
and — I; altogether, they confirm the m selection rules of Sec. 37. 

It follows from (39.7) that the sum of the intensities of all the lines 
that originate on each of the four states is equal to 6, in the above 
units. It is to be expected that these sums are equal since the four values 
of m differ only in the orientation of the angular momentum, and this 
should not affect the intensity. However, the total intensity from each 
of the two states is also equal to 6. The equality of total intensities 
from each state formed from a given L and ^ is a general property of LS 
coupling; this makes the observed intensity, which is that from all the 
states that are degenerate with respect to m, proportional to 2 J + 1.^ 
In the example considered here, the two lines of the doublet have inten- 
sities in the ratio 2:1. This is observed for the lowest doublets of the 
alkalis, although for the higher doublets, the intensity ratio exceeds 2. 
This is because the spin-orbit energy actually mixes different configura- 
tions (^P states with the same j but different n) ; the amount of mixing is 
different for the two j values, so that the two radial functions are not the 
same. A small admixture of the low-intensity upper states in the high- 
intensity lowest ®P states has little effect, whereas in the opposite case, 
there is a large effect on the doublet intensity ratio. ^ 

Effect of a Magnetic Field*® We now consider the effect of a magnetic 
field on the energy levels and transition intensities of an alkali atom. A 
constant magnetic field H can be represented by the vector potential 


A = (iH X r) (39.8) 

since H = curl A. The divergence of (39.8) is zero, so that the terms 
involving A that appear in the Hamiltonian (23.24) are 




— A • grad + . 

me ^ 2mc^ 






2mc ^ ^ 8mc''“ 


sin^ B 


(39.9) 


^ Condon and Shortley, op. cit.j p. 238. 

2 E. Fermi, Zdts.f. Fhymky 69 , 680 (1929). 

'' W. Heisenberg and P. Jordan, Zeils. f. Physik., 37 , 263 (1926). 
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where L = r X p and 6 is the angle between r and H ; c is the electronic 
charge, and hence is a negative number. 

The electron also has an intrinsic magnetic moment in the direction of 
its spin axis. The magnitude of this moment can be determined from 
comparison between experiment and the theory of the Zeeman effect 
presented below, and is in agreement with the value deduced from Dirac^s 
relativistic theory of the electron (see Chap. XII) ; it is ch/2mc, or e/mc 
times the spin angular momentum of the electron. This is twice the 
ratio of magnetic moment to angular momentum of a classical charge 
distribution for which the ratio of charge to mass density is constant. 
The magnetic moment is (e/mc)S, and the extra energy in a magnetic 
field is 

- — H • S (39.10) 

m-c ^ 

The ratio of (39,9) to the kinetic energy is quite small for magnetic 
field strengths commonly attainable in the laboratory (see Prob. 7). It is 
therefore permissible to use perturbation theory to find the effect of the 
H terms on the wave functions and energy levels. In most cases, only 
the linear terms need be considered. However, for very strong fields 
and large orbits, the quadratic terms can become of interest (see the dis- 
cussion of the quadratic Zeeman effect below). Also, the diamagnetic 
susceptibility can be obtained from the tenns in the energy that are 
proportional to H®. 

Weak-field Case. For the present, we consider only the first-order 
effects of H. The Hamiltonian (39.1) then be(iomes, with (39.9) and 
(39.10), 

^ ® ^ 

where the field is along the z axis. The magnetic field can now be clas- 
sified as weak or strong according as the last term in (39.11) is small or 
large in comparison with the spin-orbit energy. Ihc Zrrman effect usu- 
ally refei-s to the weak-field case, and the Pa^chen-Bajck effect to the 
strong-field case, although the term Zeeman effect is sometimes used to 
include all magnetic effects. 

In the weak-field (^ase, wo can make use of the wave functions (39.()), 
which are eigenfunctions of J* and ,/,. It is easily verified that the 
magnetic energy + 2 aS,) = 6(./g + aS.) lias matrix elements beiAveen 
states of different j, but not between states of the same j and diffcM'ont vi. 
We neglect the former, be(;ause of the relal/ivt^ly large energy separation 
between states of different Thus the inagnnlh; energy is diagonal with 
respect to m for each and shifts tlie (nicrgy of eac.h of ilie states (39.9) 
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by its expectation value for that state. In each case, is diagonal, so 
its expectation value is mft. The expectation value of S, for the *P| 
state with m = i, for example, is 

;;3-i[2K+)*?i.o + (-)*?u]i^<r.3-i[2»(+) Yi,o + (-)Fi,i] sin dded<l, 

= [aK+y^^'i.o + (-)*?i.i][2K+)no - sin eded<i> 

= ^( 2 - 1 )=^ 

with the help of (33.5) and the orthonormality of the spin functions and 
the Y'b. Thus the magnetic energy of this state is €h{^ + i) = 

This and the similar results for the other states (39.6) can be expressed in 
terms of the LandS g factor; the magnetic energy is 

ehmg poq io^ 

^ = i for ^ 1 for ^ = 2 for ^ ^ 

The weak-field transition intensities are given directly by (39.7). 
According to the discussion of Sec. 37, the radiation from the transitions 
in which m changes by unity is circularly polarized when viewed along 
the field and linearly polarized perpendicular to the field when viewed in 
the xy plane; these are called the a components (from the German sen’k'- 
recht). When m does not change in a transition, the radiation does not 
appear along the field, and is polarized parallel to the field (tt components) 
when viewed in the xy plane. For observation in the xy plane, the ir 
intensity is proportional to in (39.7) and the a- intensity is proportional 
to either \x\^ or \y\^ (but not to their sum). 

Strong-field Case. If the magnetic energy is large in comparison 
with the spin-orbit energy in (39.11), the field is said to be strong. In 
this case the states within a given nl configuration are better specified by 
mi and than by j and m as in (39.6). The magnetic energy is then 
diagonal and has the value 

€h(mi + 2m,) (39.13) 

If the spin-orbit -energy is neglected for the moment, the eight wave func- 
tions that correspond to (39.6), and their energy shifts (39.13), are 

(+)Fu 2eh 
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In the event that the magnetic field is very strong, the spin-orbit 
energy is most simply treated as a perturbation on the wave functions 
(39.14). We consider instead the general case, which includes all relative 
magnitudes of the magnetic and spin-orbit energies. This is done by 
working with the matrix of the last two terms in (39.11) in either of 
the representations (39.6) or (39.14). The eigenvalues of the matrix are 
the energy levels, and the transformation that diagonalizes the matrix 
gives the wave functions in accordance with the discussion of Sec. 22. 
We start from (39.14), and notice at once that the two wave functions 
are the same as the functions of (39.6). We ignore the effect of the 
spin-orbit energy on these two states, since it does not shift them with 
respect to each other; the energy shifts due to the magnetic field are 
±6^. Similarly, the first and last of the six wave functions are the 
same as the functions of (39.6) with m = if; their energies are 

if ± 2eh, where f is given by (39.4). 

The four remaining wave functions combine in pairs, according to 
whether m = + m* is equal to i or ~ i. It is enough to consider 

just one of these pairs, say that for which m = i: (+)yi,o and ( — )yi,i. 
The matrix of the magnetic and spin-orbit energies in the representation 
specified by these two states can be found with the help of the angular- 
momentum matrices (24.15). 


/ 2-4f\ 

\2-*f -if/ 


(39.15) 


In accordance with the discussion of Eq. (21.19), the eigenvalues of the 
matrix (39.15) are found by solving the secular equation 

left - X 2- if 


2- if 


-if -X 


= X^ + (if — €^)X — if(€^ + f) = 0 


In this way we obtain for the energy shifts of these two states 


X± = i[6ft - if ± + eAf + ff^)i] (39.16) 

In the weak- and strong-field limits, the upper and lower signs in (39.16) 
lead to 


X+— >^if + lefi and 
X+ eh and 


X„ — > — f + i€^ for — — ► 0 

^ (39.17) 

X--^ - if for-^-^0 


Equations (39.17) show that the state that corresponds to the upper sign 
in (39.16) is the weak-field state i = f , m = i and the strong-field state 
nil = 0, TTifl = i. Similarly, the lower sign in (39.16) corresponds to the 
weak-field state J = i, m = i and to the strong field state mz = 1, 
m. = - i. 
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The transition intensities can be found in the general case by calcula- 
ting the matrix eleinents of x, y, and z with the help of the eigenfunctions 
of {(r)!'* S -h e{Lg + 2Sg). These eigenfunctions are the first, sixth, 
seventh, and eighth of (39.14), and linear combinations of the other four 
functions that are obtained from the matrix that diagonalizes (39.16). 

Quadratic Zeeman Effect. For very strong ma^etic fields and large 
orbits or n values, effects of second order in H become appreciable. From 
(39,6), it is apparent that the effect of the spin-orbit energy becomes very 
small for large n, and a useful approximation is obtained by neglecting 
this part of the energy entirely. In this case the electron spin commutes 
with the Hamiltonian, so that m, is a constant of the motion, and the spin 
can be ignored. The Hamiltonian (39.11) is then replaced by 

H = - ^ V* + F(r) + eL. + sin* 8 (39.18) 

Since L» — — ihd/d<t> commutes with (39.18), mi is a good quantum 
number, and the only effect of the term tL, is to displace each energy level 
by the amount ^hmi. Thus for large n, we need only be concerned with 
the effect of the last term sin^ 6 in (39.18), for particular 

values of nii and m*.^ 

It follows from the work of Sec. 16 that the effective radius of a hydro- 
gen atom is roughly proportional to n^. For alkali-atom states of large 
n, y(r) has practically the Coulomb form, and the wave functions are 
very nearly hydrogenic functions. Thus H' increases about as n^. This 
means that n is no longer a good quantum number for sufficiently large n. 
For smaller n, I may not be a good quantum number since H' has off- 
diagonal matrix elements between states of different Z, and the unper- 
turbed energies of these states lie close together (they fail to be degenerate 
only because the wave functions for the smallest values of I penetrate 
the inner full shells). In this region, the perturbed energy levels can be 
found by diagonalizing the matrix of H' for given values of n, mi, and 
m,, when the n — |mi| rows and columns are labeled by L The structure 
of the H' matrix can be inferred from Gaunt’s formula (footnote 2, page 
281) ; since sin^ 6 can be expressed in terms of spherical harmonics of order 
0 and 2, the only non vanishing matrix elements Hhf are for Z — V = 0, ± 2. 
Thus the W matrix has the form (if for example, mi = 0) 

0 0 0 
0 ff'u 0 0 

Hio 0 0 

0 0 H'33 0 

0 0 Hia 0 H'44 

^ L. I. Schiff and H. Snyder, Phya, Rev., 66, 69 (1939). 
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The matrix (39.19) is equivalent to two independent matrices, one 
for even and the other for odd I, each with about ^ rows and columns. 
Direct diagonalization of these would be quite arduous for large n. How- 
ever, the resulting energy levels are so close together that they cannot 
be resolved spectroscopically, so there is little reason to determine the 
individual levels. What can be observed is the aggregate of transitions 
between the ^8 ground state (I = 0) and the group of states that are 
obtained by diagonalization of (39.19); these appear to be a single 
broadened ^^ine.” Allowed transitions occur only by virtue of the state 
with I = 1 that is mixed into each of the eigenfunctions of (39.19), so 
that mi can be only 0 or ±1. This makes it possible to find the center of 
gravity of this line and its mean-square breadth without diagonalizing 
H', as we now show. 

The unperturbed wave functions can be chosen so that is a real 
matrix. Then the unitary matrix 8 that diagonalizes H' can be real, so 
that Eq. (21.20) or Eq. (22.6) can be written 

8WS^ = E (39.20) 

where E is diagonal. In terms of matrix elements, this equation is 

8ikHki^ii = EiSii 

The new eigenfunctions Ui that correspond to the energy eigenvalues Ei 
are given in terms of the unperturbed wave functions Vi by (22.3) 



If now we neglect the dependence of the radiative transition probability 
on energy over the small range of energies involved in this group of 
states, the transition probability is proportional to the square of the 
amount of i>i in each Ui, or to Tims the energy levels Ei should bo 
weighted in proportion to aSiJ. The center of gravity of the group of 
perturbed energy levels is given by 

i?uv.. = 2 = H'n 

i 

since (39.20) can bo inverted to give 11' = S*ES, In similar fashion, 
the mean-square breadth of the line is 

2 {JUi - 

i 

= y ii[\ - Ei^, = H[i 

T 


I 



288 


QVANTVM MECHANICS 


[Chap, XI 


Thus only two of the matrix elements of W need be calculated. It is 
apparent that both the displacement (apart from the factor €hmi) and 
the breadth of the line are proportional to 

40. MOLECULES 

Molecules are considerably more complex in stmcture than atoms, 
and correspondingly less has been accomplished in the quantitative 
application of quantum mechanics to molecular problems. In this sec- 
tion, a general accoimt of the nature of molecular energy levels is followed 
by a simple explicit calculation for the hydrogen molecule and a somewhat 
more general treatment of diatomic molecules.^ 

Classification of Energy Levels. The simplifying feature that is the 
basis of all molecular approximations is the large ratio of nuclear mass to 
electron mass. As we shall see shortly, this implies that the energy 
associated with the motion of the nuclei is much smaller than that associ- 
ated with the motion of the electrons about the nuclei. Since the period 
of a motion is of the order of h divided by its energy, the nuclear periods 
are correspondingly longer than the electronic periods. It is then a good 
approximation to regard the nuclei as fixed in calculating the electronic 
motion. Moreover, the nuclear motion can be calculated under the 
assumption that the electrons have their steady motion for each instan- 
taneous arrangement of the nuclei '(adiabatic approximation). 

The nuclei are expected to have a stable equilibrium arrangement 
somewhere between a completely collapsed structure (which is unstable, 
since the nuclei are positively charged and repel each other at short dis- 
tances) and a completely dispersed structure (which is not the most stable 
structure if a molecule exists). The nuclear motions can then be classified 
into translations and rotations of the quasi-rigid equilibrium arrange- 
ment, and internal vibrations of the nuclei about equilibrium. As with 
atoms, the translational motion is the same as that of a free particle 
[see the discussion of Eqs. (16.5)], and gives rise to no nonclassical 
features. 

We thus arrive at a classification of molecular energy levels into 
electronicy vibrational, and rotational types, and proceed to estimate their 
relative orders of magnitude. Suppose that the molecule has linear 
dimensions of order a. Then the energy associated with the motion 
of a valence electron (one that occupies roughly the whole of the molecular 
volume, rather than one that is bound in an inner shell close to a nucleus) 

^ For more dotailod discussions, see G. Herzberg, “Molecular Spectra and Molec- 
ular Structure” (Prentice-Hall, New York, 1939); L. Pauling and E. B. Wilson, Jr., 
“Introduction to Quantum Mechanics,” Chaps. X, XII, and XTII (McGraw-Hill, 
New York, 1935). 



Sec. 40] 


ATOMS, MOLECULES, AND ATOMIC NUCLEI 


289 


is of order where m is the electronic mass. This can be inferred 

by an argument like that given near the beginning of Sec. 9, by noting 
that the momentum uncertainty of the electron is at least of order A/a, so 
that its minimum kinetic energy is We thus obtain 


Eg 


ma^ 


(40.1) 


For values of a of the order of a few angstrom units, this corresponds 
to transition frequencies in the visible and ultraviolet regions of the 
spectrum. 

To estimate the vibrational energy, we regard each of the normal 
modes as a harmonic oscillator with which is associated a mass M and a 
stiffness constant Ko. M will be of the order of a typical nuclear mass. 
Kq can be estimated by noting that a displacement along a normal mode 
by the order of the molecular size a must produce an energy change of 
the order of the electronic energy Ee, since such a large displacement 
would produce a substantial distortion of the electronic wave function; 
we thus put Kq Ee/a^. Then the energy associated with a fairly 
low mode of vibration is, from (13.8), 


where use has been made of (40. 1) . E„ is roughly a hundred times smaller 
than Ee, and corresponds to transitions in the near infrared. 

The rotational energy Er can be estimated from the moment of inertia 
of the molecule, which is of order Afa^. As would be expected, the 
angular momentum of a fairly low mode of rotation turns out to be of 
order A, so that 


This is about a hundred times smaller than and corresponds to transi- 
tions in the far infrared. 

It might be expected from Eqs. (40.2) and (40.3) that the electronic, 
vibrational, and rotational energy levels can be obtained as successively 
higher orders in an approximation that is based in some way on the small 
ratio m/M (which is usually in the range 10“® to 10""^). This was shown 
to be the case by Bom and Oppenheimer.^ They used as expansion 
parameter the ratio of a typical nuclear vibrational displacement to the 
internuclear distance (which is of order a). An oscillator of energy 


^ M. Born and J. R. Opponheimer, Ann. d. Physik, 84, 467 (1927), 
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E^ and stiffness constant Kq has a displacement of order 



so that the expansion parameter is 

In terms of this, the electronic energy is of zero order, the vibrational 
energy of second order, and the rotational energy of fourth order; the 
first- and third-order energies vanish. 

Wave Equation. The time-independent Schrodinger equation for a 
molecule is readily written down 

i = l j-p-l 

There are n electrons and N nuclei, and V is the sum of the electrostatic 
interactions between all pairs of them. It is apparent that the nucleai* 
kinetic-energy terms are of fourth order in the expansion parameter 
(40.4). If they are neglected, the wave function ^ involves the nuclear 
coordinates R,- only parametrically, and (40.6) is a wave equation in 
the ti for the motion of the electrons with respect to nuclei that are fixed 
in space. In this case, yj/ is approximately WRy(ri) and corresponds to 
the energy eigenvalue t/(Rj). The nuclear motion can then be found by 
regarding i7(R/) as a potential function and using it to obtain a nuclear 
wave function ^^^(R^). 

We therefore write ^ in the form 

\^(ri,R/) = (40.6) 

where u satisfies the equation 

(- ^ V? + uM = C/(R,-)Mii,(rO (40.7) 

1 

For each arrangement of the nuclei, i7(Rd is obtained as an eigenvalue 
of Eq. (40.7). There wiU, in general, be several solutions that corre- 
spond to different electronic states of the molecule; care must be taken to 
ensure that u and U change continuously with R,-, especially if the system 
is degenerate. Substitution of (40.6) into (40.5) gives, with the help of 
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(40.7), 

i-l 

which can be rewritten 

N 

UR,(r,) [- ^ + C7(R,) - .e] w(R^ 

J-1 

N 

= ^ [«)(Ri)V|MR,(r{) + 2 grad, w(R,) • grad, UR,(r,)] (40.8) 

If now the dependence of w on Rf is neglected, the right side of (40,8) 
drops out, and an approximate wave equation for the nuclear motion is 
obtained: 

V 

[ - V v?+ t/(R,)] «)(R^) = Sio(R,) (40.9) 

jTi 

The neglect of the grad,- u terms derives physically from the smallness of 
the amplitudes of the nuclear motion in comparison with the equilibrium 
internuclear distances [smallness of the expansion parameter (40.4)]; 
this implies that the electronic part u of the wave function does not change 
much as the nuclei move. Born and Oppenheimer have shown formally 
that this approximation is justified so long as not too high vibrational 
and rotational modes are excited. 

The Hydrogen Molecule. It is clear from the foregoing discussion 
that two distinct problems arise in connection with molecular structure. 
The fii^t is the solution of Eq. (40.7) to obtain electronic wave functions 
and a potential energy function [^(R,) of the nuclear coordinates. The 
second is the solution of (40.9) for the nuclear motion. The first prob- 
lem can be solved only in the simplest ciuscs. As an example, we now 
consider in outline an approximate solution for the hydrogen molecule 
due to Heitler and Tjondon.^ Following this, we discuss the solution of 
Eq. (40.9) for a general diatomic molecule, making simple assumptions 
concerning the potential energy U, 

The only nuclear (mordiriate R/ that appears in (40.7) in the case of 
the hydrogen molecule is the magnitude H of the distance between the 
two hydrogen nu(4ei. The Hamiltonian is that given in Eq. (27.11) 


^ W. Hoitlor and K. Foiidon, ZailH.J. Physikt 44, <165 (1027). 
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(see Fig, 24) ; however, Rmno longer large in comparison with 



so that the approximations implied in (27.11) and (27.12) are no longer 
useful. Nevertheless, an approximate wave function based on a simple 
product of two ground-state hydrogen-atom functions gives remarkably 
good results. The reason for this is that exchange degeneracy (see Sec. 
32) is taken into account: the degenerate wave functions for which elec- 
tron 1 is on nucleus A and electron 2 on nucleus B, and for which electron 
1 is on nucleus B and electron 2 on nucleus Aj are both used at once. 
The new feature of the work of Heitler and London was the recognition 
that an appropriate linear combination of unperturbed degenerate wave 
functions gives a significantly lower energy than the separate wave 
functions; it is the basis of the present-day theory of homopolar binding m 
molecules. This property of degeneracy is sometimes referred to as 
resonance. Just as an interaction between two classical oscillators that 
are in resonance (same unperturbed frequency) gives rise to a normal 
mode that has a lower frequency (and also one that has a higher fre- 
quency), so does an interaction between two resonant (degenerate) 
states in quantum mechanics give rise to a lower energy eigenvalue (as 
well as to a higher one).^ There may of course be more than two degen- 
erate unperturbed states, and the degeneracy need not be of the exchange 
type. 

Potential-energy Function. Equation (40.7) for the hydrogen mole- 
cule is 


[H - - 0 

H = - ~ (yi + VI) + + - L _ J_ 

2772. \R ri2 ?’iA Tofi 


rui T2 a) 


We wish to base our approximate calculation of V(li) on the approximate 
wave functions 


2Zi(ri,r2) = UA{ti)uB{x2) 
W2(ri,r2) = UA{r2)uB{xx) 


(40.11) 


where Ua and Ub are ground-state hydrogen wave functions [wioo in the 
notation of (16.24)] based on nuclei A and respectively. It must 
first be noticed that the U\ and of (40.11) are eigenfunctions of different 
unperturbed Hamiltonians, so that the degenerate perturbation theory 
of Sec. 25 is not applicable. This is unlike the situation with the helium 


^ This use of the word resonance is only remotely related to that which appeared in 
Sec. 19 in connection with scattering. 
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atom [see the discussion of (33.7)], where the two exchange-degenerate 
wave functions are solutions of the same unperturbed Hamiltonian. 

We can, however, use the variation method of Sec. 27, in which case 
it is natural to adopt as the trial function an arbitrary linear combination 
of U\ and 

^(ri,r2) = 'Wi(ri,r2) + Ai^2(ri,r2) (40.12) 

where A is the variation parameter. Substitution of (40.12) into (27.6) 
gives 


TTfD\ ^ "I" -4^)J?ll + 2AH\2 

= l+A^ + 2Ay ■ ■' 

Hu = H22 = ! JuiHuidTxdT2, Hi2 


y s JJwiUssdridra 
= H 21 ^ f juiHu2dridT2 


These equalities between matrix elements are easily established with the 
help of Eq. (22.10) when it is remembered that the are real, and that 
H is Hermitian and symmetrical in the two electrons. 

The matrix elements and 7 depend on R, For any particular value of 
R, the derivative of the right side of (40. 13) with respect to A is 


2(1 - A^){Hx2 - Tffii) 

(1 + A 2 + 2Ayy 


which is zero for A = ±1, Since the right side of (40.13) is equal to 
Hu when -4 is — 00 , 0 , and + 00 ^ one of the points A = ±1 must be a 
minimum and the other a maximum. The integrals in (40.13) can be 
expressed in terms of tabulated functions, and the minimum expectation 
value of H is obtained with A = H- 1 : 


U{R) ^ 


xj/ = Ui + U2 

Hu + Hx 2 


1 + 7 


(40.14) 


The upper limit on U(R) given in (40.14) has the general appearance that 
is characteristic of the internu clear potential energy of a diatomic mole- 
cule (see Fig. 30) and is in good agreement with experiment.^ Since ^ in 
(40.14) is symmetric in an interchange of the space coordinates of the two 
electrons, it must be multiplied by the antLsymmetric singlet spin func- 
tion given as the last of Eqs. (33. G). 

It is interesting to compare t.he symmetry characters of the ground 
state of the hydrogen molecule and the excited states of the helium atom 
considered in Sec. 33, from a physical point of view. Because of the 
exclusion principle, electrons must be in dilTerent space states if they 
have parallel spins, and hence tend to keep away from each other. In the 


^ Pauling and Wilson, op. dL, Sec. 43a, 
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excited 1828 state of helium, for example, this reduces the electrostatic 
repulsion of the electrons and lowers the energy. Thus the triplet states 
of helium tend to lie lower than the singlet state of the same configuration. 
(The situation is different for the ground state, since only the singlet state 
can exist for the configuration.) In the ground state of the hydrogen 
molecule, on the other hand, the lowest energy (strongest binding) is 
obtained when the electrons tend to concentrate between the two nuclei, 
since then the repulsion between the electrons is more than compensated 
by the attraction of both nuclei for each electron. This occurs when the 
electrons can occupy the same space state and hence when they have 
antiparallel spins. Thus it is the singlet state that leads to a stable 
molecule. 

The Morse Potential. We now turn to diatomic molecules in general, 
and consider the nature of the solutions of Eq. (40.9) for the nuclear 
motion. If the nuclei have masses Mi and and their relative position 
vector R has polar coordinates R,6y4>, the equation for the relative motion 
becomes [see Eqs. (16.5)] 

- ^ + i7(ie)] w{RM) = Ew(JiM) (40.15) 

where M == AfiM 2 /(Mi + ilf 2 ) is the reduced mass. 

It has been found by experience that the potential-energy function 
for the lowest electronic states of actual diatomic molecules can be repre- 
sented quite accurately by a simple analytic function that contains three 
adjustable parameters 


U{K) = C7o e 


2(/e-.Ro) 


{R-Ro) 


- 2e 


(40.16) 


Equation (40.16) represents the Morse potentialj^ and is draAvn in Fig. 30. 
U approaches zero exponentially for large R, has the minimum value 
— ?7o at i2 = Rqj and becomes large and positive as R approaches zero if 
the “breadth^' a of the attractive region is somewhat smaller than the 
equilibrium distance Rq. 

Figure 30 has the general appearance that would be expected for a 
diatomic molecule. The zero of energy is arbitrarily chosen to be zero 
when the neutral atoms are far apart; then U becomes negative at firet 
as R decreases, because of the van der Waals attraction. ^ For smaller iZ 
this is replaced by the much stronger Heitler-London resonance attrac- 


^ P. M. Morse, Phys, Rev., 34, 67 (1929). 

® One of the inaccuracies of the Morse potential is its roplaoomont of the 1/72“ 
van der Waals term (see the end of Sec. 27), by an exponential ; howevt^r, the behavior 
of U at such large 72 has little influence on molecular energy levels. 
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tion. Ab fl coatinues to decrease, the close approach of the nuclei (or 
ionic cores) gives rise to a repulsion that causes U to increase and eventu- 
ally become large and positive.^ 

Rotation and Vibration of Diatomic Molecules. Equation (40.15) 
can be separated in spherical coordinates, as was Eq. (14.1), to give 

w(R,e,<l>) = ^ 


K and Mr are the angular-momentum quantum numbers that are 
analogous to I and m, respectively, for a single particle in a central field. 



The radial equation is 


d^X 


2MdW^ 


+ W{R)x = Ex 


(40.17) 


W{R) = U{R) + 


mi{K + 1 ) 


2MR^ 


K - 0,1,2, 


Equation (40.17) is the wave equation for the ono-dimensional motion 
of a particle of mass M in a potential W{R)^ with the boundary condition 
that X vanish at U = 0. If K is not too large, the general shape of W 
resembles tluit of U shown in Fig. 30, In this case, we are primarily 
interested in vibrations of small amplitude about the minimum. We can 
then expand W about its minimum at Ri, which is only the same as R{) 
if = 0, to give 

W(R) = TFo + ^K,{R - RiY + h{R - RiY + c(72 - RiY (40.18) 

' Unlike the true interiK^tion, the MorHO potential is finite at 7? = 0. 
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where higher order terms are neglected. If the h and c terms are also 
neglected and the domain of 22 is extended to — ^ the eigenvalues of 

(40.17) are those of a linear harmonic oscillator with an additive term 
TTo. This is a good approximation for moderate values of the rotational 
quantum number K and the vibrational quantum number v. A somewhat 
better approximation can be obtained by regarding the b and c terms in 

(40.18) as perburbations on the oscillator. Since the b term produces 
only a second-order effect (see Prob. 2, Chap. VII), whereas the c term 
appears in first order (its expectation value can be computed by matrix 
methods as in Prob. 6, Chap. VI), both make contributions to E that have 
the same order of magnitude. 

Energy Levels. The eigenvalues of (40.17) to lowest order in b and c 
are then 


E “ Wq “f- h 


(ii 




iv + i)- mv + + A] 

+ 2MKI ^ ~ 


(40.19) 


PTo, Ko, b, and c can all be expanded in powers of K(K + 1), where the 
coefficients depend on the parameters of the function U{R). If U has 
the form (40. 16), the following expressions can be obtained ; 


Ri = Ro ”1“ 


h^IC{K + l)a* 
2MRlUo 


„ _ h^KiK + 1) 


Ko = 
b = 


2Uo U^K{K + 1) a 
a* MRla^ Rb 

-Ul =1^ 

a^' ^ 12a* 


¥KKK + l)2a« 

mmiUo 

~li) 


(40.20) 


Only enough terms have been retained to give E correctly to second order 
in («; + i) and K{K + 1). 

The first of Eqs. (40.20) shows that the molecule stretches owing to 
rotation. The second equation is the equilibrium energy — Uo, plus the 
rotational energy to second order. The first-order rotational energy is 
+ l)/27o, where 7o MRl is the moment of inertia of the mole- 
cule about an axis perpendicular to the line joining the nuclei; this energy 
is the same as for a rigid rotator (see Prob. 12). The third equation 
includes the change in the stiffness due to stretching. The corrections 
for stretching in the anharmonic terms b and c can be neglected to this 
order. The second term on the right side of (40.19) can be expanded 
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with the help of the expression for ifo to give 


\May 


(f + i) 


1 - 


Zh^K{K + 1) 


L fio)] 


The last two terms in (40.19) give the second-order vibrational energy 

(- H + tV) (" + i)* 2^* 


since the constant factors cancel. 

It is apparent that the rotational and vibrational energy levels agree 
in order of magnitude with the estimates made at the beginning of this 
section. As either v or K increases, the spacing between levels becomes 
smaller than that predicted from the simple rigid rotator and harmonic 
oscillator. 

Effect of Nuclear Identity. In the event that the two nuclei of a 
diatomic molecule are identical, the wave function must be symmetric 
with respect to an interchange of their space and spin coordinates if the 
nuclei have zero or integer spin, or antisymmetric if they have half-odd- 
integer spin (see Sec. 33). The discussion of Sec. 14 shows that the par- 
ity of the nuclear wave function is determined by the angular function 
and is even or odd according as K is even or odd. An inter- 
change of the space coordinates of the two nuclei is equivalent to a change 
in sign of their relative position vector R, so that the parity determines 
the space symmetry of the wave function. We thus see that for nuclei 
of zero or integer spin, the spin function must be symmetric for even K 
and antisymmetric for odd K] for nuclei of half-odd-integer spin, the spin 
function must be antisymmetric for even K iipd symmetric for odd K, 

The discussion of Sec. 33 shows that for two nuclei of spin Ih each, 
the total of {21 + 1)^ spin states can be divided into (/ + 1)(2J + 1) 
symmetric states and I {21 + 1) antisymmetric states. Thus in a gas 
that is in statistical equilibrium, the ratio of the number of molecules 
with even K to the number with odd K will be (Z + 1)/Z if Z is zero or an 
integer, and I /{I + 1) if I is half an odd integer.^ This effect gives rise 
to alternating intensities in the band (rotational) spe(d.nim of homonu- 
clear diatomic molecules. Both the spin and the statistics of appropriate 
nuclei can be determined in this way, and the results are in agreement 
with the general statement in Roc. 33. 

41. ATOMIC NUCLEI 

The application of ([uantum mechanics to the investigation of the 
structure of atomic nuclei en trails great mai.ht^inatical complexities in all 

^ This ratio is, of c.ourHi^, modified by tho Boltzinaiui factor if the spacing between 
rotational levels is not small in comparison witlj the thermal energy kT. 
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but tke simplest cases. In this section we review briefly present-day 
ideas of nuclear structure and the nature of the formal difficulties encoun- 
tered. Following this, we consider some elementary properties of the 
nuclear two-body problem.^ 

General Properties of Nuclei. Atomic nuclei are now believed to 
consist of protons and neutrons, both of which are called nucleons. Other 
particles (mesons, electrons), which probably have at least a transient 
existence within nuclei, are usually ignored in structure theories. Pro- 
tons are nuclei of hydrogen atoms, and neutrons are particles that have 
about the same mass, no electric charge, and the same spin and statistics 
as protons (spin Fenni-Dii'ac statistics). A nucleus can be character- 
ized by its charge Ze^ where Z is an integer and e is the positive proton 
charge, and its mass M, which is measured in units of of the mass of 
0^® (oxygen isotope of mass number 16). M is always found to be close 
to an integer A, called the mass number. The number of neutrons in a 
nucleus is equal to A — Z; thus the deuteron (heavy-hydrogen nucleus) 
H® consists of one proton and one neutron, the alpha particle (helium 
nucleus) He^ of two protons and two neutrons, and the gold nucleus Au^®^ 
of 79 protons and 118 neutrons. 

According to the theory of relativity, the difference between the sum 
of the masses of the Z protons and A — Z neutrons in a nucleus and the 
mass M of that nucleus, multiplied by the square of the speed of light, is 
the energy evolved when the separate nucleons are brought together to 
form the nucleus. This energy is called the binding energy of the nucleus, 
and is conveniently measured in units of a million electron-volts (Mev). 
The mass defect M — A is an approximate measure of the binding energy,- 
and the packing fraction (M — A) /A is a correspondingly approximate 
measure of the binding energy per nucleon. The packing fraction is 
found to be nearly constant for the nuclei between oxygen and mercury, 
and is larger at both ends of the periodic system. Thus the binding 
energy per nucleon has a nearly constant value for most nuclei, and is 
about 8 Mev. 

Each nucleus has a quite well-defined radius R, which ranges from 
about 3 X 10”^® cm for the alpha particle to about 10^^^ cm for the heavi- 
est nuclei. The radius can be specified in terms of the interaction 
between the nucleus and either a neutron or a charged particle (proton, 

1 For more detailed discussions, see the papers by H. A. Bcthe, R. F. Batcher, and 
M. S. Livingston, Rev. Mod, Phys.^ 8, 82 (1936); 9, 69, 245 (1937); G. Gainow, “Stnic- 
ture of Atomic Nuclei and Nuclear Transformations” (Oxford, New York, 1937); 
H. A. Bethe, “Elementary Nuclear Theory” (Wiley, Now York, 1947); L. Rosenfold, 
“Nuclear Forces” (Interscience, New York, 1948). 

* It is only approximate, since the proton and neutron masses are soniewliat greater 
than unity and not quite equal to each other. 
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alpha particle) . In the first case, collision experiments indicate that there 
is a separation distance beyond which there is no interaction and within 
which a neutron is strongly attracted by the nucleus. In the case of a 
charged particle, collision experiments show a distance within which the 
long-range Coulomb repulsion is dominated by a strong, short-range, 
attractive interaction. Some of the heaviest elements emit alpha 
particles radioactively, and the nuclear radius can be determined from 
the relation between atomic number, alpha-particle energy, and radio- 
active decay rate. The radii obtained from considerations of all these 
kinds are in substantial agreement. The volume per nucleon turns out 
to be nearly constant for all but the lightest nuclei; this can be expressed 
by the relation R == roA*, where ro = 1.4 to 1.5 X 10“^^ cm. 

Nature of Nuclear Interactions. The constancy of the binding energy 
and volume per nucleon suggests that a nucleus is constructed somewhat 
like a drop of liquid, in which the binding energy and volume per atom (or 
molecule) are also constant. The liquid structure arises from the inter- 
atomic potential curve (see Fig. 30, page 295) ; the attractive region (near 
the minimum of the potential) produces cohesion, and the repulsion at 
short distances prevents collapse. Thus the interatomic distance (and 
hence the atomic volume) is constant ; the interaction is confined to near- 
est neighbors, so that the binding energy per atom is constant. Theo- 
retical considerations basep on the binding and collision of a pair of 
nucleons (see below) indicate that there is probably no appreciable 
short-range repulsion in their interaction, which resembles instead the 
square well potential of Fig. 13, page 76. The ciuestion then arises as to 
what it is that prevents the nucleus from collapsing into a region whose 
dimensions are roughly independent of the mass number, with a conse- 
quent increase in the binding energy per nucleon as A increases. 

It might at first be thought that the exclusion principle would prevent 
collapse, since there cannot be more than four nucleons (two protons and 
two neutrons of opposite spins) in the same space state. That this is 
not the case can be seen in the following way. The discussion of the 
Thomas-Fermi atom model in Sec. 38 shows that the maximum and aver- 
age kinetic energies of a number of independent particles that obey the 
exclusion principle are proportional to the two-thirds power of the spatial 
density. Thus for a region of given size, the total kinetic energy is pro- 
portional to ri.9. However, the total potential energy if each particle 
can interact with every other is proportional to the number of pairs of 
particles, or approximately to A'K Thus the most stable state ocguvh 
when each particle is within int(n‘action range of every other, and the 
total binding energy is roughly proportional to A® rather than to A, as 
observed. 
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It is necessary, then, to assume that the number of nucleons that can 
interact strongly with a particular one is limited in some way that is inde- 
pendent of the mass number. This is the kind of effect that occurs in 
polyatomic molecules that are held together by homopolar bonds. As 
discussed in the last section, these bonds are produced by resonance 
between unperturbed states that are degenerate with respect to electron 
exchange. Homopolar bonds are saiurated: the exchange of a pair of 
electrons between two atoms tends to preclude the formation of an 
electron-exchange bond with a third atom. On the basis of this analogy, 
the assumption is made that the interaction between a pair of nucleons is 
also of the exchange type and shows saturation. A formal treatment can 
be made without specifying what it is that is exchanged between the 
nucleons, although it appears that mesons are responsible for the inter- 
action. What is required is the inclusion of an exchange operator P 12 in the 
interaction energy V(l,2): 

7(1,2)Pi2^(1,2, . . . ) - 7(1,2)^(2,1, . . . ) (41.1) 

The P operator can act on the space or spin coordinates of the two 
particles, or both. The combination of different kinds of exchange 
operators that should be assumed, and the extent to which they can be 
replaced by interactions that couple the spins and orbital motions of a 
pair of nucleons (tensor interactions), are not settled at the present 
time.^ 

The near equality of the numbers of protons and neutrons in the 
nuclei up to neon indicates that the interaction between a pair of protons 
is about the same as that between a pair of neutrons. As A increases 
further, the ratio of number of neutrons to number of protons increases, 
up to about 1.6 for the heaviest nuclei. This can be understood in terms 
of the repulsive Coulomb interaction between protons, which does not 
saturate, and hence adds a term of order Z{Z — l)e‘'/R to the total energy 
of a nucleus. This energy is relatively small in He^ (less than 1 Mev as 
compared with a binding energy of 28 Mev), but becomes progressively 
more important as Z increases. Without this term, the most stable 
nucleus of a given mass number and radius has the same number of 
neutrons and protons, since the kinetic energy of the protons is roughly 
proportional to Z^, that of the neutrons to (A — Z)®, and the sura of 
these is a minimum when Z = iA, Thus the inclusion of a positive Z^ 
term shifts the minimum toward smaller Z. For the heaviest nuclei, 
the Coulomb term leads at first to alpha radioactivity, and eventually 

1 See for example G, Breit and E. Wigner, Phys. Rev.^ 63, 998 (1938) ; G. M. Volkoff, 
Phya. Rev., 62, 126, 134 (1942), 
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to instability with respect to fission of the nucleus into two parts of com- 
parable size. 

The approximate equality of neutron-proton and like-particle inter- 
actions is indicated by the large binding energy of the alpha particle 
(28 Mev) in comparison with that of the deuteron (2.2 Mev). The 
alpha particle has six pairs of interacting particles, and hence about tmce 
as much binding energy per pair as the deuteron. Thus the like-particle 
interactions cannot be much weaker than the neutron-proton interaction. 
Also, they cannot be much stronger or the di-proton (He^) would be 
stable, and it does not appear to exist. This qualitative conclusion is con- 
firmed by the more quantitative treatment of the two-nucleon problem 
that is outlined below. 

Nuclear Models. The mathematical difficulties that appear in the 
treatment of all but the two-nucleon systems are due to the lack of a sim- 
ple model for the nucleus that preserves its essential features. As pointed 
out in Sec. 38, the central-field approximation is useful in treating many- 
electron atoms because of the dominant role of the nuclear Coulomb 
field. In the last section, it was shown that the disparity in mass between 
electrons and nuclei makes possible an effective treatment of molecules. 
With nuclei, we have a situation in which the masses and interaction 
energies of all nucleons are about the same. Moreover, the interactions 
are strong and of short range, which means that a nucleon is more strongly 
influenced by its neighbors of the moment than it is by the average 
motion of all the other nucleons. In spite of this, some work has been 
based on a central field approximaiion, and the Hartree method (see Sec, 
38) has been applied. 

The most realistic model, at least in the case of the heavier nuclei, 
is Bohr^s liquid-drop modcL^ This exploits the analogy between nuclear 
matter and licpuds that was mentioned earlier in this section. The 
liquid-drop model was first devised to explain the large ratio of radiative 
capture cross section to scattering cross section found in the collisions of 
slow neutrons with medium and heavy nuclei. If the nucleus acts simply 
as a potential Avell for the incident neutron, as in the central-field approxi- 
mation, application of the results of Secs, 19 and 36 shows that capture 
accompanied by the emission of electromagnetic radiation (gamma ray) 
is much less probable than scattering. Bohr showed that the experi- 
mental results can be understood if the incident neutron is assumed to 
amalgamate with the nucleus and distribute its energy among many of 
the nuclear particles. It then takes so long for enough energy to con- 
centrate on a neutron to permit it to escape (this would be observed as 
neutron sc.attering), that the excited nucleus is likely to radiate away 

1 N. Bohr, Nature, 137, 344 (1936). 
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its excess energy before that happens. The liquid-drop model has since 
been used to gain a semiquantitative insight into other nuclear properties. 

Neutron-Proton System. The most fundamental problem to be 
solved in connection with nuclei is the determination of the parameters 
of the interaction energy between pairs of nucleons. This adds a degree 
of complication to the theoretical study of nuclei as compared with the 
study of atoms and molecules, since in the latter cases the main interac- 
tion is known to be given by Coulomb’s law. However, qualitative 
considerations of the kind presented earlier in this section, together with 
extensive calculations and comparisons with experiment for the systems 
that contain two, three, and four nucleons, show that the basic interaction 
is strong and of short range. ^ The precise shape of the potential does not 
seem to make much difference, and we shall assume the square well form 
(see Fig. 13, page 76): 7(0 = — Foforr < a, F(r) = Oforr > a. When 
suitable exchange operators and tensor factors are included, this form is 
adequate for aU situations except perhaps high-energy neutron-proton 
scattering. In this case, the existence of an additional weaker inter- 
action of longer range could have an appreciable effect (see the discussion 
of phase sliifts in Sec. 19). 

It was shown in Sec. 16 that there is no bound state of a particle of mass 
m in this potential unless V oa^ > 7r^A^/8m. This result can be applied at 
once to the deuteron by taking m to be the reduced mass of the neutron- 
proton system, which is very nearly half the proton mass : 

— = 1.04 X 10“^^Mev-cm^ 

If we assume that a = 2.80 X 10“^® cm, 7o must exceed 13.3 Mev in 
order for the deuteron to exist. Since the deuteron appears to have only 
one bound state, we assume that it corresponds to Z = 0. Then the 
solution of Prob. 7, Chap. IV, shows that the measured binding energy of 
2.18 Mev is obtained with Fo = 21.4 Mev. 

The scattering cross section for neutrons of very low energy on pro- 
tons can then be obtained from Eq. (19.28). If we neglect E in compari- 
son with 7 q we find that <r = 4,3 X 10“®^ cm^. The measured cross sec- 
tion for neutrons of a few electron-volts energy in hydrogen is about 
21 X 10“®^ cm®; this energy is small enough so that it can be neglected in 
comparison with Fo, and large enough so that the binding of the proton 
in the hydrogen molecule does not affect the result. An explanation of 

1 Specific arguments against the existence of a short-range repulsion, strong enough 
to account for the obaei-ved density of heavy nuclei witliout th(^ assumption of an 
exchange interaction, have recently been given by C. Parpen and li. I. S(!hiff, Phya. 

74 , 1664 ( 1948 ). 
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this discrepancy in terms of the dependence of the neutron-proton inter- 
action on the spin state was suggested in 1935 by E. Wigner (unpub- 
lished), The deuteron is known to have spin A, and so is in a triplet spin 
state. However, as discussed in Sec. 34 in connection with exchange 
collisions of electrons with hydrogen atoms, the colliding neutron and 
proton will be in a triplet state in three-fourths of the collisions, and in a 
singlet spin state in one-fourth of the collisions. Thus the discrepancy 
is removed if the singlet cross section is taken to be 71 X lO^’^^cm^. 

If we assume that a is also equal to 2.80 X 10“^® cm for the singlet 
interaction, Eq. (19.28) shows that this cross section is obtained for a 
potential depth of either 12.2 or 14.7 Mev. It is apparent that this is 
the resonance scattering case discussed in Sec. 19, and that these two 
potentials correspond to virtual and bound singlet states, respectively. 
The decision as to which is correct cannot be made on the basis of the 
dependence of the scattering cross section on incident neutron energy; it 
is shown in Sec. 19 that with Z = 0, cr is a monotonic decreasing function 
of E in both cases, and there is not enough difference in the behavior of 
the two functions. It is found from other considerations^ that the singlet 
state is virtual, so that the depth corresponding to this range is 12.2 Mev. 

Exchange Operators. The spin dependence of the neutron-proton 
interaction found in connection with low-energy scattering can be 
expressed in terms of a spin-exchange operator [see Eq. (41.1)]. Such an 
operator multiplies a triplet (symmetric) spin function by +1 and a 
singlet (antisymmetric) spin function by —1. It follows from the fore- 
going discussion that about 20 per cent of the total triplet interaction is of 
the spin-exchange type. 

A space-exchange operator^ multiplies Avave functions of even Z by + 1 
and wave functions of odd Z by — 1 ; it has no effect on the results obtained 
thus far, which are all for Z = 0. For higher energy scattering, the dis- 
cussion of Sc(^ 10 shows that the partial wave with I = 1 can be sig- 
nificant. If the phase shift 5i is small in magnitude and the higher phase 
shifts can be neglected, Ecp (10.32) can be approximated as 

cr{B) ^ ^ (sin^ 5o + 35i sin 25o cos 6) 

For energies high enough so that 5i is appreciable, 5o is likely to be 
between 0 and 90°, so that the sign of the angular asymmetry is deter- 
mined by the sign of 5i. 

^ Most roliably from tho scattorinR of very slow nniitrona in ortlio- and para- 
hydrogen; aeo J. Sehwingor and K. Teller, J^hya, Itiw., 52, 28C (1937). 

* This waa introducicd ])y K. Majorana, Zailft. J. Phynik, 82, 137 (1933), as a modifi- 
cation of the original Huggestion of W. Heisenberg, ZnU.J. Phynik, 77, 1 (1932). 
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If now the interaction is predominantly of the space-exchange type, 
the potential energy is positive (repulsive) for I = 1, and 5i is negative. 
Then neutrons incident on protons are preferentially scattered backward 
in the center-of-mass coordinate system, or at right angles in the labora- 
tory system. The protons tend to recoil in the forward direction in both 
systems. This effect can be regarded physically as a normal (non- 
exchange) scattering, in which the neutrons are preferentially scattered 
in the forward direction, together with an exchange of identity between 
the neutron and the proton. 

Proton-proton Scattering. The scattering of fast protons in hydrogen 
can be treated by the methods outlined in Sec. 20, with suitable allow- 
ance for the identity and spin of the two colliding protons (see Sec. 33). 
When the interaction is a pure Coulomb field, combination of Eqs. 
{20.10) and (33.2) yields the MoU scattering formula^ 

cr(^) = ^2^) I 

— cosec® ^8 sec® ^6 cos In (tan® ^0) j | (41.2) 

which is expressed in the center-of-mass coordinate system. This for- 
mula represents only the experiments for protons of less than about 0.2 
Mev bombarding energy, since for higher energies the protons approach 
closely enough so that the specifically nuclear interaction becomes 
appreciable. Up to several Mev energy, only the 5n term in (20.24) need 
be included. It must be remembered that because of the exclusion 
principle, the partial wave with Z = 0 is associated with a singlet spin 
state. Thus experiments with fast protons are required if information 
concerning the triplet proton-proton interaction is to be obtained. The 
indication at present is that the singlet proton-proton interaction agrees 
in range and sign with the singlet neutron-proton interaction discussed 
earlier in this section, and is slightly smaller in magnitude.^ 

Problems 

1. Find an expression for the electron density n{r) in the Thoinas-Fermi model 
in terms of the dimensionless function x, and show that the nuUiia of a sphere that 
encloses a fixed fraction of all the electrons is proportional to Z~^. 

2. Use Lagrange’s method of undetermined multipliers to show that the condition 
that the integral in (38.9) is stationary, when the are varied but kept normalized, 
is given by (38.11). 

3. Two p electrons (Z 1) can have L = 0, 1, or 2 and S “ 0 or 1, in the Russell- 
Saimders case. Are all combinations of L and S permitted if the n values of the two 
electrons are different? Are they all permitted if the n values are the same? 

^ N. F. Mott, Ptoc. Roy. Soc.j A126, 269 (1930). m is half the proton moss, and 
V is the relative speed. 

* G. Breit, H. M. Thaxton, and L. Eisenbud, Phys. Rev.j 56, 1018 (1939). 
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4. Show that spin wave functions in the Russell-Saunders case that are eigen- 
functions of S* with different values of the total-spin quantum number S are orthog- 
onal. Use a method like that which was employed in Sec. 10 to show that energy 
eigenfunctions corresponding to different eigenvalues are orthogonal. 

5. Show by direct computation that J «= L + S commutes with L*S, and hence 
with the Hamiltonian (39.1). 

6. Use the M matrices given in Eqs. (24.16) to show that the wave functions 
(39.6) have the indicated eigenvalues of and Jg. 

7- Estimate the ratio of the term in (39.9) that is linear in H to the kinetic-energy 
term, for a hydrogen atom in a magnetic hold of 10® gauss. 

8 . Estimate the magnitude of the magnetic field strength for which the two 
perturbation terms in (39.11) are equal for an alkali atom. 

0. CJonatruct a diagram that sliows the relative displacements and intensities of 
the IT and <t Zeeman components of the — >• ^S transitions in an alkali atom when the 
magnetic field is weak. Construct a similar diagram when the field is strong. 

10. Show that the ratio of the distance traveled by the nuclei of a molecule during 
a period of the electronic motion, to the dimensions of the molecule, is of order {m/M)^ 
in the case of vibrational motion, and (m/Jlf ) in the case of rotational motion. Do 
these results justify an adiabatic type of approximation? 

11 . Does the intern ucleor potential (40.14) obtained for the hydrogen molecule 
approach the — l/E® form obtained in Sec. 27 for large if? If not, why doesn^t it? 

12. Set up and solve the wave equation for a rigid rotator: an object that has no 
kinetic energy of rotation about a particular axis, and equal moments of inertia about 
the two perpendicular axes. 

13. Derive the selection rules for transitions between rotational levels of a diatomic 
molecule. 

14. Discuss the selection rules for vibrational transitions when the two nuclei of a 
diatomic molecule are the saiiic and when they are dilferent. 

16. Calculate the contributions to the vibrational energy of a diatomic molecule of 
the neglected fifth- and sixth-power terms in the expansion (40.18), and show that 
their neglect is justified in arriving at the (o i)“ t^^iK^rgy given in (40.21), Use matrix 
methods to get the muided matrix olenient.H of and for a harmonic- oHc-illator. 

16. Calcuhito the niinimuiu kinetic, (mergy in Mev of a nucleon that moves iiL a 
rigid-walled sphere whosc^ radius li (‘.orresponds to a uranium nuclous. 

17. Suppose that the interaction between a neutron and a proton is the same in 
the singlet os in the triplet state, and is rei)rcsented by a sqiiare well. Is there any 
value of a that will lit both th(^ d(ait(n'on binding energy (i = 0) and the slow-nentron 
scattering cross section? If so, what is it? 

18. Make use of the work of Bc^c. 20 to obtain an expression for the ratio of the 
proton-proton scattering with a phase si lift 5n (and no others), to the Mott scattering 
given by Eq. (41.2). 

19. The ground-state wave function V'o of an atom or nucloiis, if nondegeucrato, 
has a definite value ./o for its total-anguhir-momcntum quantum number. Since the 
diagonal matrix chunent of ati operator U for a static V' vanishes unless is spheri- 
cally symmetric, show that the expectation value of S2 for the state is zero unless 
the function when exj)anfle(l as a series of eigenbmctioiis of the operator J*, 
contains a term with quanUim number Jo. Tf U is the operator for a 2^ electric or 
magnetic multipolc monuMit, show also that its expectation value is zero unless Jo ^ 
(compare with the discussion at the end of Sec. 25), 



CHAPTER XII 

RELATIVISTIC WAVE EQUATIONS 


In this chapter we extend the nonrelativistic Schrodinger wave 
equation to the description of the motion of a particle that has a speed 
approaching that of light. This extension can be made in many Avays, 
each of which is consistent with the Lorentz transformation equations of 
the special theory of relativity.^ A characteristic feature of relativistic 
wave equations is that the spin of the particle is built into the theory from 
the beginning, and cannot be added afterward as Pauli added the electron 
spin to Schrodinger's nonrelativistic equation. This feature provides a 
useful gauge of the applicability of a particular equation to the description 
of a particular kind of particle. Two relativistic equations are con- 
sidered here: the scalar equation due to Schrodinger that describes a 
particle without spin, and the more complicated equation due to Dirac 
that describes an electron. In discussing these equations, we devote 
our attention mainly to the deductions that can be made from them and 
do not attempt to establish their Lorentz invariance. We shall, there- 
fore, continue to use three-dimensional vector notation rather than the 
more elegant four-dimensional notation of special relativity theory. 
The invariance of an equation can usually be inferred quite convincingly 
from its symmetry betAveen the space coordinates and the time. 

42. SCHRODINGER’S RELAXmSTIC EQUATION 
At the time when Schrodinger developed his nonrelativisitic Avave 
equation, he also proposed an extension of it that meets the requirements 
of special relativity. ^ This equation follows quite naturally from the 
transition in classical dynamics from the nonrelativistic relation 



( 42 . 1 ) 


between the energy and momentum of a free particle, to the corresponding 
relativistic relation 




( 42 . 2 ) 


1 For a review of special relativity, see, for example, P. O. B(^rgmiinn, ‘‘Introduc- 
tion to the Theory of Relativity,” Part I (Prentice-Hall, New York, 1046), or R. C. 
Tolman, “Relativity, Thermodynamics and Cosmology,” Chaps. II, III, and IV 
(Oxford, New York, 1934). 

* E. Schradinger, Ann. d. Phyaikj 81, 109 (1926), Sec. 6. 
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where now E includes the rest-mass energy mc^. We proceed by adopting 
the substitutions (6.13) for E and p 

E ih p —ih grad (42.3) 

Free Particle. A relativistic wave equation for a free particle can be 
obtained by substituting (42.3) into (42.2) and operating on a wave 
function |/'(r,i), just as the substitution of (42.3) into (42.1) yields Eq. 
(6.11). The result is 

(42.4) 

at 

Equation (42.4) has plane wave solutions of the form 

exp ^(k • r — ojO (42.5) 

which are eigenfunctions of the operatom E and p in (42.3) with eigen- 
values ho3 and /Ik, respectively. It is apparent that (42.6) satisfies Eq 
(42.4) if 

ho3 = ± + m^c^)4 (42.6) 

The positive and negative square roots in (42.6) correspond to an ambigu- 
ity in the sign of the energy that also results from the classical expression 
(42.2). We take only the positive scpuire root for the present, and 
return to the negaUvo-cnergy solutions at the end of Sec, 44. 

Expressions for the charge and current densities (^an be found in 
analogy with those obtained in Sec. 7. The conservation equation 

|p(r,0 + divS(r,0 = 0 (42.7) 


turns out to be invariant with respect to Lorentz transformations. We 
multiply (42.4) on the left by the complex conjugate equation on the 
left by and subt.ract one from the other. Then (42.7) results, if we 
define real quantises 


P{x,f) = - vt— ^ 

^ ' ’ 2mc“ V ^ / 

S(r,/) = ^ grad yf/ - ^ grad if) 


(42.8) 


This expression for S is identical with the nonrelativistic form (7.3), 
and the expression for P can be shown to reduce to (7.1) in the non- 
relativistic limit (see Prob. 2). It should be noted that P given by (42.8) 
is not necessarily positive, and hence cannot be interpreted as a position 
probability density. It can, however, be multiplied by e and interpreted 
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as an electric charge density, since charge density can have either sign so 
long as it is real. 

Electromagnetic Potentials, We can include the electromagnetic 
potentials A(r,0, in the wave equation by making use of the fact 
that <t> and (l/c)A have the same Lorentz-transformation properties as 
E and p. In analogy with the nonrelativistic expression (23.14), we 
replace (42.2) by 

{E — 60)2 = (cp — 6A)2 + (42.9) 

for a particle of charge e. The substitutions (42.3) then give 

{-»’S 

= [—^ 2^272 _|_ 2iehcA • grad + iehc{div A) + e^A^ + (42.10) 


We can now find the connection between Eq. (42.10) and the similar 
Eq. (23.24) in the nonrelativistic limit. We make the substitution 

imoH 

0(r,O = 0'(r,i)e" ^ (42.11) 

in (42.10), and assume that operation with ih(d/dt) on 0' gives a result 
that is of the same order as and small in comparison with mcV'- 
This is equivalent to subtracting out the rest energy and assuming that 
the remaining energies are small in comparison with it. Differentiation 
of (42.11) gives 


dt \dt 

av ^ /ay 

\ dt^ 



2imc^ d\p' 
~ h 


imcH 

li 





e 


irncH 

ft 


The first term in each of these derivatives can be neglected, as can the last 
two terms on the left side of (42.10), which then becomes 


With these approximations, Eq. (42.10) becomes the same as (23,24) 
if 0 ' is replaced by 0 . 

There is no way in which the Pauli spin matrices (33.3) can be included 
in Eq. (42.10) without destroying the invariance of the theory. Phis is 
not surprising, since the spin matrices transform like the (^onlponents of 
three-dimensional, rather than a four-dimensional, vector, and since 
has one component rather than two components like the spin functions 
(33.4). Thus the Schrodinger relativistic equation represents a particle 
that has no spin. 
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The structure of Eq. (42.9) shows that a “potential-energy'’ term 
cannot be added arbitrarily to (42.10), as the term was added to 
(23.24) to give Eq. (36.1). The Lorentz-transformation properties of 
any such term must be investigated first. If it transforms like part of a 
four-dimensional vector, the rest of this vector must be included in some 
such manner as 0 and (l/c)A were included in (42.9). If it is an invariant 
with respect to Lorentz transformations, it can be included as part of the 
rest energy 

Separation of the Equation. The wave equation (42.10) can be 
separated with respect to r and t if the potentials A,<#) are independent of 
the time. We then put 

Hu) = 'u(r)e » 

and substitute into (42.10) to obtain 
(E - e(l>yu 

= + 2iehcA > grad + iehc(div A) + eW + m^c^]u (42.12) 

We now specialize to the case in which A = 0 and is spherically 
symmetric. Equation (42.12) then becomes 

+ m^c^)u(T) =[E - e<t>(T)]^uit) (42.13) 

which can be separated in spherical coordinates (see Sec. 14) to give 
u(r,9,<l>) = R(r)Yi„,{d,<l>) 


I = 0 , 1 , 2 , . . . 

This reduces to the nonrelativistic radial equation if we put E = mc^ -|- E', 
and assume that £7' and e(j) can be neglected in comparison with me®. 
Then the bracket on the right side of (42.14) becomes (2m/ft®) {E' — e<l>), 
as it should. 

Energy Levels in a Coulomb Field. An exact solution of (42.14) can 
easily be obtained if we put e<l) = —Zc^lr^ by making use of the results of 
Sec. 16. This situation would represent a hydrogen atom were it not for 
the fact that the particle described by (42.10) has no spin, and so cannot 
be an electron. 

If we put p = aVj Eq. (42.14) can be written as 


(E — e<f>)® ■ 


(42.14) 


r 1 d( d\ ;a + i) ip_ r 

dr V drj ^ r“ J 


1 d f ,dR\,{\ 

7dpy jfj + Vp 

Z(^ 



— E^) ^ 2Ey 

h'^c^ ' hca 


CL 


(42.16) 
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This has precisely the form of Eq. (16.7) except that l{l + 1) has been 
replaced by l{l + 1) 7 ®. The parameter X is determined by the bound- 

ary condition on R when p = oo, and E is expressed in terms of X by 
eliminating a from the last two of Eqs. (42.15): 

E = mc^ ^1 

A study of the way in which Eq. (16.7) was solved shows that solu- 
tions of (42.15) that are finite at p = 0 and oo exist only if 

X = n' + $ + 1 (42.17) 

where n' is zero or a positive integer and s is the nonnegative solution of 
the equation 

s{s + 1) = l{l + 1) - 7 ® (42.18) 

Equation (42,18) has the two solutions 

S= - i[(2Z + ly - 47^]* (42.19) 

of which one is positive and the other negative for ? > 0. For Z = 0, 
both the s values given by (42.19) are negative; however, y is quite small 
(very nearly equal to Zj 1S7 if e is the electronic charge), so that use of 
the upper sign in (42.19) gives a value of s that is close to zero for physi- 
cally interesting values of Z. Moreover, even though R{r) behaves like 
r® near r = 0, and so is singular at the origin, the integral of P(x) given 
by (42.8) converges there so that the total electric charge is finite. Wo 
thus use the upper sign in (42.19) for all Z, and obtain from (42.17) 

X = n' + i + [(Z + i)2 - 7 ^]* (42.20) 

Equations (42.16) and (42.20) give a fine structure to the nonrela- 
tivistic energy levels (16.15). This can be seen by expanding the expres- 
sion for the energy levels in powers of 7 ^. The result to terms of order 
7 ^ is 

where n = w' -f Z + 1 is the total quantum number of Eq. (16.14), and 
can take on positive integer values. The first term on the right side of 
(42.21) is the rest energy. The second term is 

mc^y^ _ mZ^e^ 


+ ( 42 . 16 ) 


and agrees with (16.15). The third term is the fine-structure energy, 
which removes the degeneracy between states of the same n and different 
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1. The total spread of the fine-structure levels for a given n is easily 
seen from (42.21) to be 


n — 1 
n — \ 


(42.22) 


This is much larger than is observed experimentally in the spectrum of 
hydrogen. 


48. DIRAC’S RELATIVISTIC EQUATION 
Dirac ^ approached the problem of finding a relativistic wave equation 
by starting from the Hamiltonian form (23.1) 

I = HHU) (43.1) 

The classical relativistic Hamiltonian for a free particle is the positive 
square root of the right side of Eq. (42.2). However, if this is substituted 
into (43.1) and p is replaced by —ih grad, the resulting wave equation is 
unsymmetrical with respect to space and time derivatives, and hence not 
relativistic. Dirac therefore modified the Hamiltonian in such a way 
as to make it linear in the space derivatives. 

Free-particle Equation. The simplest Hamiltonian that is linear in 
the momentum and mass terms is 

H = — ca ■ p — (43.2) 

Substitution into (43.1) leads to the wave eciuation 

(E + CQc * -p + = 0 

or (43.3) 

— ihc ja ■ grad + = 0 

We now consider the four quantities a*, a*, and /3. Tf (43.3) is to 

describe a free particle, there can be no terms in the Hamiltonian that 
depend on the space coordinates or the time, since such terms would 
have the properties of spacc-time-dependcnt energies and give rise to 
forces. Also, the space and time derivatives are to appear only in p and 
E, and not in a and /3, since (43.3) is to be linear in all these derivatives. 
We thus conclude that a and ^ are independent of r, t, p, and jB, and hence 
commute with all of them. This does not necessarily mean that a and /3 
are numbers, since they need not commute with each other. 

^ P. A. M. Dirac, Proc. Roy. Sor., A117, 610 (1028); “Tho Principles oP Qiianlnni 
iMochanics,’^ 2d od., Chap. XTl (Oxford, Now York, 
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We can leam more about a and p by requiring that any solution ^ of 
(43.3) also be a solution of SchrSdinger's relativistic equation (42.4) 
(the converse need not be true). This is a reasonable requirement since 
in the absence of external fields, the wave-packet solutions of (43.3) 
whose motions resemble those of classical particles must have the classical 
relation (42.2) between energy, momentum, and mass (see Prob. 1). 
We therefore multiply Eq. (43.3) on the left by (i? — ca * p — fimc^) to 
obtain 

{E^ - + (4vl + + (a*% + 0Lya^)VxVy 

+ (oya, + 0La0Ly)Vy'Pz + (a«a» + olxOL2)'PzV:A ““ 

772C®[(Q!a^ "I” ^OLx)Pa H” ^^)Py H” “f“ ” 0 (^^3.4) 

where the substitutions (42.3) for E and p in terms of diiTerential opera- 
tors are implied. Equation (43.4) agrees with (42.4) if a,/3 satisfy the 
relations 

aj = aj = aj = /3» = 1 

OL^aOty + Oiyaa = CLyag + agOLy = agOLx + OLxCtg^ 0 (43.6) 

otafi + = OLyP + ^cty = -f- = 0 

The four quantities are said to anticommute in pairs, and tlunr scpiares are 
unity. 

Since a, ^ anticommute rather than commute with cacli oilier, they 
cannot be numbers. We have seen in Chap. VI that quantities of this 
type can be expressed in terms of matrices, and it is convenient for calcula- 
tion to have a matrix representation of them. We note first that since 
the H giy^ by (43.2) is Hermitian, each of the four matrices must 
be Hermitian, and hence square. Our problem is to find an explicit repre- 
sentation in which, say, one of these matrices is diagonal (then the others 
cannot be diagonal since they do not commute with this one). In the 
interests of simplicity, we shall require the representation to have as low a 
rank as possible. 

Matrices for a and /?, The squares of all four matrices are unity, so 
that their eigenvalues are +1 and -1. We arbitrarily clioose p as the 
matm that is to be diagonal, and rearrange its rows and crohimns ho that 
all the +1 eigenvalues are grouped together in a matrix of rank n, and all 
the 1 eigenvalues are grouped in a matrix of rank m. Since anti- 
commutes with a, it cannot be a constant matrix, and so both n and m 
m^ e greater than zero. The matrix can be represented schemati- 

I iTT n A ^ 
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10.. 
0 1.. 

■ • o o 

. . o O 

> 

o 

o 

-10. . 

0 0.. 

1 

o ■ 

. . . . 

. . . J 


(43.7) 


The dashed lines in (43.7) separate the submatrioes 1, 0, 0, and —1 that 
appear in (43.6).^ We now consider the matrix equation +■ = 0, 

the jl element of which is 

+ Pi) = 0 


Here jSy and pi are two of the eigenvalues of P, which are arranged in 
accordance with (43.6) or (43.7). If Pj = pi^ then (o:*)/* =" ^9 whereas if 
pj and Pi have opposite signs, need not be zero. Thus the matrix for 
a» can be written in the form 



where a^i has n rows and m columns, and ax 2 has m rows and n columns. 
Since the square of (43.8) is a unit matrix, we see also that 

OlxlOlx 2 = 1, Clz 20 isl = 1 (43.9) 

The unit matrix that appears on the right side of the first of Eqa. (43.9) 
has n rows and columns, and the unit matrix in the second equation has 
m rows and columns. It is not difficult to show that no matrices can 
be found that satisfy (43.9) if equal 1,2 or 2,1. We therefore con- 
sider the two possibilities n = m = 1 and n = m = 2, as giving matrices 
of minimum rank. It is apparent that ay and a* can be written in forms 
similar to (43.8). 

We have already obtained three anticommuting matrices of the form 
(43.6) or (43.8) with n = m = 1, These are the Pauli spin matrices 
(33.3) 

= = = (43.10) 

which satisfy the equations 

ffiCy ” ^ to" g (43.11) 


1 The matrices 1, 0 are the same as the unit and null matrices 1, O defined in 
Sec. 21. 
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together with the two similar relations obtained by permuting x, z* Any 
matrix with two rows and columns has four elements, and so can be 
expressed as a linear combination of the four linearly independent 
matrices o-*, o-y, o-,, and 1. Then it is not difficult to show that a fourth 
matrix that anticommutes with all three of those in (43.10) cannot be 
found. 

We therefore try n = m = 2, and for simplicity take == 
etc. Then Eqs. (43.9) become aJi = 1, and the equation + otyOdB, = 0 

becomes + oLyioi.»i = 0. From these and the similar relations 

obtained by permuting we see at once that we can identify otxi with 
o-flj, etc. We thus arrive at an explicit matrix representation for : 



(43.12) 


These matrices are evidently Hermitian; we abbreviate them as 


9-(j -?) «-(° S) <«.13) 

where each “element” is a matrix with two rows and columns.^ 

Free-particle Solutions. Now that a and ^ are represented by 
matrices, Eq. (43.3) has no meaning unless the wave function ^ is itself 
a matrix with four rows and one column: 




I ’AsCM) I 
\ I 


(43.14) 


Then (43.3) is equivalent to four simultaneous first-order partial dif- 
ferential equations that are linear and homogeneous in the four y/z’s. 
Plane wave solutions of the form 


= Uj exp ?(k • r - ut), j = 1,2, 3, 4 (43.16) 

^ The relativistic character of the Dirac equation can be made more apparent if 
(43.3) is multiplied through on the left by this enhances the symmetry between 
space and time derivatives, since the four matrices jS, have similar properties to the 
four matrices /3, a. 
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can now be found, where the Uj are numbers. These are eigenfunctions 
of the energy and momentum operators (42,3) with eigenvalues ftco and 
ftk, respectively. Substitution of (43.15) and (43.12) into (43.3) gives 
a set of algebraic equations for the t/f, where E ^ hca and p = are now 
numbers, 

{E + mc^)ui + cptU^ + c(pz — ipj/)w4 = 0 

(E + mC^)U2 + C{pa: + ^Pv)^^3 — Cp^Ui = 0 , ^ . 

(E — mc^)u3 + cpzUi + c{px — iPv)y‘2 = 0 ^ ^ 

{E — mc^)u\ -f c(p® + ipy)Ui — CPeU2 = 0 


These equations are homogeneous in the and have solutions only if 
the determinant of the coefficients is zero. This determinant is 


(E2 - 


so that the relation between E and p is in agreement with (42.2). 

Explicit solutions can be obtained for any momentum p by choosing a 
sign for the energy, say Then there are two 

linearly independent solutions, which are conveniently written as 




cp, 

E+ + 



U2 = 


C(px + iPy) 
E+ + mc^ 


) 


U2 


CPz 

J5+ + 


U3 == 1, U4. — 0 

(43.17) 

W3 == 0, U4 — 1 


Similarly, if we choose the negative square root — — (c^® + 
we obtain two new solutions, which are conveniently written as 


ui = 1, U 2 = 0, 


_ CPz 

W3 = ’ I 

— + mc^ 


ui = 0, U 2 = 1, 


U 2 


Cjpx - iPu) 
-E^ + mc^' 




c{px + iPv) 

— £?- + mc^ 

(43.18) 


it4 = 


CPz 

—E^ + mc^ 


Each of these four solutions can be normalized, in the sense that ^ V = 1, 
by multiplying it by the factor jl + [c^'^/{E^. + mc^)-])"*; is the 
Hermitian adjoint of i/', and is a matrix with one row and four columns. 

It is apparent that the solutions (43.17) correspond to positive energy, 
and the solutions (43.18) to negative energy. In the nonrelativistic 
limit, in which is close to mc^ and large in comparison with 

cjpl^ Ui and U 2 are of order v/c times or for the positive-energy solu- 
tions (v is the speed of the particle) ; the opposite is true for the negative- 
energy solutions. The physical distinction between the two solutions 
for each sign of the energy can be seen by defining three new spin matrices 
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o'm «tf» <’■» that have four rows and columns 

-'■(S 0 

We shall see at the begiimiiig of Sec. 44 that ^hd' can be interpreted as 
the operator that represents spin angular momentum. When the small 
components of the wave function can be neglected, it is easy to see that 
^ is an eigenfunction of <r'^ with eigenvalue +1 for the fimt of each pair of 
solutions (43.17) and (43.18), and eigenvalue —1 for the second solution 
of each pair. 

Charge and Current Densities. We can obtain a conservation equa- 
tion by multiplying the second of Eqs. (43.3) on the left by the lier- 
mitian adjoint equation 


— + ihc{grB.d^p*) • a + = 0 

at 


on the right by V', and taking the difference of the two results, 
get Eq. (42.7) if we define the real quantities 

P(r,0 = 

S(r,i) = — c^*a^ 


We then 


(43.20) 


The expression for P has the nonrelativistic form (7.1); since it is never 
negative, it can be interpreted as a position probability density. It can 
be shown that the expression (43.20) for S reduces to (7.3) in the non- 
relativistic limit (see Prob. 6). 

The operator —col can be interpreted more directly as a particle 
velocity by calculating the time derivative of the position vector r from 
Eq. (23.2), With the Hamiltonian (43.2) and the commutation relations 
(23.16), we obtain 

^ 1 - a) - (43.21) 


Thus the eigenvalues of a velocity component are ±c. This result can 
be made plausible from a physical point of view with the help of the 
uncertainty relation (3.1). A very precise measurement of instantaneous 
velocity [which according to (43.21) is not the same as momentum in a 
relativistic theory] requires the accurate measurement of the position of 
the particle at two slightly different times. Such accurate position 
measurements imply that the momentum of the particle is completely 
unknown, so that all momentum values are about equally probable. 
Then very large momenta are much more likely to result than small 
momenta, and these correspond to velocity components close to the 
speed of light. 
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Electromagnetic Potentials. Terms tluit involve the electromagnetic 
potentials can be added to Eq. (43.3) in a relativistic way by making the 
usual replacements cp — > cp — eA and jE — > 5 where the particle 
described by the equation has electric charge e. We thus obtain 

— 60 + a • (cp — eA) + pinc^]\l/ =» 0 (43.22) 

Here E and p stand for the operators (42.3). This equation can be 
reduced to a form that is similar to (42.10) by multiplying it on the left 
with [J5 — 60 — a • (cp — eA) — The result is 

{{E — 60)^ — [a • (cp — cA)]^ — 

+ (i/ — e0)a • (cp — cA) — a • (cp “ eA){E — e0)}0 =» 0 (43.23) 

The second operator in (43.23) can be reduced by making use of the 
following relation: 

(a • B)(a • C) = B • C + id' • (B X C) (43.24) 

where B and C commute with a but not necessarily with each other (see 
Prob. 7). We identify both B and C with (cp — eA) ; we also require the 
relation 

(cp — eA) X (cp — eA) =» — ce(A X p + p X A) « ie^curl A = iehcH 

where use has been made of (23.16). With this substitution, Eq. 
(43.24) becomes 

[a • (cp — eA)]2 = (cp — eA)® — ehc6' ■ H 


The last two operators in (43.23) can be rewritten as 


— ea • {EA — AE) — cea • (0p — p0) 

— —ieha • 


dA 

dt 


— iehca • grad 0 = ieftca - E 


where use has again been made of (23.15). Equation (43.23) then 
becomes 

[{E — e0)® — (cp — cA)® — m®c^ + ehcd' • H + iehcct • E]0 = 0 (43.26) 

The first three terms are precisely the same as (42.10). The physical 
significance of the last two terms will now be shown from consideration of 
the nonrelativistic limit of the entire equation. 

We could proceed just as we did in obtaining the nonrelativistic limit 
of Eq. (42.10). A slightly different approach consists in putting 


E = jE 7' "b 771C® 


(43.26) 
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and assuming that E' and ecl> are small in comparison with mc^; this is 
equivalent to the substitution (42.11) and the subsequent approxima- 
tions. We can then make the replacement 

(E — e4>y — = 2mc^{E' — e4>) 

in (43.25) to obtain 



Now E' is equivalent to the time derivative operator ih{d/dt) if a factor 

xmcH 

e ^ is taken out of Thus (43.27) is the nonrelativistic Schrodinger 
equation (23.24), with two additional terms that involve H and E directly. 

The H term has the form associated with the energy of a magnetic 
dipole of moment (e?i/2mc)d'. Now it was shown in (43.18), for a free 
particle, that the third and fourth components of the wave function are 
large in- comparison with the first and second components for the positive- 
energy solutions in the nonrelativistic limit. This can also be shown 
without difiBculty, when the particle is not free, from the structure of the 
general equation (43.22). Equation (43.19) shows that d'' operating on 
the four-component wave function is the same as d operating on the large 
components alone. Thus the two large components of (43.27) with the 
H term give just the nonrelativistic equation with the Pauli spin matrices 
and the correct coefficient for the magnetic moment of an electron [see 
Eq. (39.10)]. 

We now show that in practical cases, the E term in (43.27) is of order 
{v/cY times the g4> term, and so is to be neglected in the nonrelativistic 
limit. We note first that the expectation value of a is of order 
since (43.13) shows that a mixes the large and small com- 
ponents (we also saw earlier that —col is the velocity operator). For an 
electron that is part of a system of linear dimensions a, is of order 
eEa, and ft/a ^ p ^ mv. Thus the ratio of the E to the terms in 
(43.27) is of order 

eftvE 1 

c^ 

In contrast with this, we have already seen in Eqs. (39.9) and (39.10) that 
the H term in (43.27) is of the same order as the other magnetic terms 
that are linear in A. While the E term must be omitted from (43.27) 
for a consistent nonrelativistic approximation, it cannot be dropped from 
the relativistic equation (43.26), where it is required to preserve Lorentz 
invariance. 
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44. DIRAC’S EQUATION FOR A CENTRAL FIELD 

In the last section, the existence of the magnetic moment of an elec- 
tron was demonstrated explicitly by showing that the expected extra 
magnetic energy appears in the nonrelativistic approximation. The 
electron spin carries no energy in itself, and so can be observed only 
through its coupling with the orbital motion of the electron. In the 
first part of this section, we make this coupling apparent in two ways: 
through conservation of total angular momentum, and through the spin- 
orbit energy that was introduced in Sec. 38. In both cases we work with 
such potentials A,<^> that there is no transfer of angular momentum to the 
electron; this implies that we have a central field (A = 0 and <#> spheri- 
cally symmetric) . In the latter part of this section, we separate the Dirac 
equation for a general central field, and find the energy levels of the 
hydrogen atom. 

Spin Angular Momentum. With A(rj^) = 0 and <#>(r,i) = <l>{r), Eq. 
(43.22) can be written as 

H = — ca ■ p — + V 

where V = ecj). We might expect that the orbital angular momentum. 
L — r X p is a constant of the motion in such a central field. In order 
to investigate this point, we calculate the time rate of change of L with 
the help of Eqs. (23.2) and (23.16): 

ih ^ = LJI - HL, 

= -6'« • [ivp, - zpv)p - p(l/Pz - ZPv)] 

= ihc{oigj[)jj ciyPz) (44.2) 

since L commutes with any spherically symmetric function such as V{r), 
It is apparent that L docs not commute with //, and hence is not a con- 
stant of the motion. However, wo expect on physic, al groundvS that it is 
possible to define a total angular momentum that is constant in a central 
field of force. This means that we must find another operator such that 
the commutator of its x (jomponeiit with H is the negative of the right 
side of 44,2); the sum of this operator and L is then a constant of the 
motion and c.an \h\ int(M*prele(l as the total angular momentum. 

It is not difficult to see that th(^ desired operator is a multiple of the 
d' defined in (43.19). From (43.11) and (43.13), we find that cr'^ com- 
mutes with dx and although not with the other components of a: 
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_ /tr« 0 \ /O (ry\ ^ fO o- A /<Tx 0\ 

\0 (Tx) v^y 0 / \cry 0 / \0 o**/ 

’{L r)-Ut-)-“- 


The time rate of change of 6' can now be obtained: 

ih^ = yjl - He’, = -2ic(«.p„ - a^p,) 


(44.3) 


It is apparent from (44.2) and (44.3) that the quantity L + ^hd' com- 
mutes with H and can therefore be taken to be the total angular momen- 
tum. We refer to the operator 

(44.4) 


as the spin angular momentum of the electron. 

Approximate Reduction; Spin-orbit Energy. We now wish to show 
that the spin-orbit energy (38.13) is a consequence of the Dirac equation. 
This term can be shown to be of order (v/c)^ times the potential energy: 


1 1 ON 1 1 ^ 

V 2m^c^ r dr ^ ^ y' 




— 


where a represents the linear dimensions of the system, and 

h 

- /^ p mv, 
a 

Thus the approximations that led to (43.27) are not adequate for the 
present purpose. 

In order to obtain a consistent approximation that is expressed in 
terms of the more familiar two-component wave functions, we replace 
^ in (44.1) by and ^ 2 , which now represent respectively the first two 
and the last two components of We assume that ^ 1,^2 together con- 
stitute a nonrelativistic energy eigenfunction, which means that 

E — mc^ 

is regarded as a number rather than an operator; E^ and V are assumed to 
be small in comparison with mc^. The wave equation then becomes 

{E' + 2mc^ - 7)^1 + cd • p ^2 = 0 , gN 

(E' - 7)^2 + cd • p^i = 0 ^ ^ 

where p is still an operator. It is apparent from the first of these equa- 
tions that is of order v/c times ^ 2 , so we eliminate it to obtain an equa- 
tion in terms of ^2 alone. The substitution 

i/'i = — {E' + 2mc® — 7)'^cd • pt ^2 
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in the second of Eqs. (44.5) gives 


Thus far, no approximations have been made. 

The desired approximation is obtained by keeping the lowest terms in 
an expansion in powers of (E' - V)/2mc\ The following relations are 
easily established: 


/i + 

V ^ 2mc* J 


(d • grad 7)(d 


2mc* 
p7 = 7p - 
p) = (grad 7) 


S 1 - 


E' -V 


grad 7 

P + id ■ [(grad 7) X p] 


With the help of these, (44.6) becomes 

- (grad 7) • (grad ^f's) + d • [(grad 7) X p\^a] (44.7) 

Further simplifications can be made if 7 is spherically symmetric. 
We use the relations 

(grad 7) • grad = -j- ^ 


grad7=--^r 


and note that F' - 7 is approximately equal to pV2m, the accuracy 
being sufficient to replace the second-order term {E 7)p® in (44.7) 
by ^*/2m. We can then rewrite (44.7) as 




Pl--^-h7- 

2m 8m»c» ^ 


h'^ dV d , 
4m®c® dr dr 


^ A— S • L 

2m Vr dr 


] 




(44.8) 


where now S = and L = r X p» 

The first and third terms on the right side of (44.8) give the non- 
rdativistic Schrodinger equation. The second term has the form of the 
classical relativistic mass correction, which can be obtained by expanding 
the square root of (42.2) : 

F = F - mc“ = (cV + wi*c*)» - 


The last term is the spin-orbit energy (38.14), which is now seen to 
appear as an automatic consequence of the Dirac equation. The fourth 
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The equatioas for / and g are 

^ 0 

P \ot P/ 

p \ci p; 

We look for solutions of (44.19) in the form of power series 

/ = p®(ao + dip +■•*)> flo 0 

g — P®(&o + bip + • ■ • ), bo 0 

Since (44.14) is supposed to be finite at r = 0, we expect that a is greater 
than or equal to one. However, in analogy with the solution of the 
Schrodinger relativistic equation (42.16) for the Coulomb field, we shall 
admit a value of a slightly less than one, since the volume integral of 
will still be finite. 

We substitute (44.20) into (44.19), and equate the coefficients of 


(44.19) 


(44.20) 


(fi + + fr)b» — bp-i — yap — — = 0 

"" (44.21) 

(5 p Jc)dy •“ dp^i 4" ybp bp^i = 0 

OL 

for p > 0, When « 0, the equations analogous to (44.21) are 


{s + k)bo — 7^0 = 0 
(fi — k)ao + 760 = 0 


(44.22) 


Equations (44.22) have the required nonvanishing solution for ao and 
60 only if the determinant of their coefficients vanishes; this gives 

s = ± (P - 7^)^ (44.23) 


Because of the boundary condition at the origin, we take the upper sign 
for 8 in (44.23). 

A relation between ap and bp can be obtained by multiplying the first 
of Eqs. (44.21) by a, the second by ori, and subtracting. 

6 p[q '(8 + V + fc) — 0:17] = a„[ai(s + v — fr) + ay] (44.24) 

where use has been made of (44.16). We can now examine the behavior 
of the solution at large r. Unless both the series (44.20) terminate, this 
behavior is determined by their high terms, so we can neglect constant 
factors in comparison with v. We then obtain from (44.21) and (44.24) 


2 


2 
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This means that both series have the asymptotic form and regular 
solutions are obtained only if they terminate. Suppose that thfe occurs 
at = n', so that On'+i = 6fi'+i = 0. Then both Eqs. (44.21) give the 
relation 

aiOnf = —oibft', n' = 0,1,2, , . . (44.25) 

We obtain energy levels by setting v = n' in (44.24), and making use 
of (44.26). With the help of (44.16), we find that 

2a(s + n') = 7 (q!i — CC 2 ) = 

The square of this is 

(m^c^ - E^){s + ny = 
which is easily solved to give 

Equations (44.23) and (44.26) are equivalent to the formula first de- 
rived by Sommerfeld^ on the basis of the old quantum theory. This 
formula accoimts quite well for the spectrum of hydrogen.* The fine 
structure is made evident by expanding (44.26) in powers of y*. The 
result to terms of order 7* resembles (42.21) but is not quite the same: 


where n =* n' + lA"! is the total quantum number of Eq. (16.14), and 
\k\ can take on positive integer values. The total spread in energy of 
the fine-structure levels for a given n is easily seen from (44.27) to be 

mc^y^ n ~ 1 
2n 


This is substantially less than the value (42.22) obtained from the 
SchrSdinger relativistic equation, and is in agreement with experiment. 

Classification of Energy Levels. For n' > 0, all positive and nega- 
tive integer values of k are permissible [we saw from (44.12) that k can- 
not be zero]. For n' = 0, however, a contradiction can arise between 
(44.22) and (44.25) ; these give 


and 

Cq s k flo cx. 


(44.28) 


1 A. Sommerfeld, Ann. d. Physik, 61, 1 (1916). 

® Small deviations from the Sommerfeld-Dirac fine-strueture formula have recently 
been established by W. E. Lamb, Jr., and R. C. Retherford , Phys. Rev.^ 72, 241 (1947). 
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respectivdy. Since s < |/r|, the first of these expressions is positive or 
negative according as A* is positive or negative, whereas the second is 
always negative. Thus for — 0, A* can assume only negative integer 
values. 

Thus far we have only shown that the 3 value of a level is equal to 
[A-l — In order to connect I with the level, we must make the non- 
relativistic approximation that the orbital angular momentum is well 
defined. Since in this case 0 in (44.14) is much larger than F, we can 
replace by — 1 and d' by d in (44.10). Now in this approximation, 
(L + ihdy = [1(1 + 1) + iW + Ad . L, and is also equal to j(j + l)h\ 
We obtain in this way 

As an example of the energy levels in hydrogen, we consider the case 
w = 3. The radial quantum number n' can be 0, 1, or 2 , and k can be 
±(3 — nO except that k can be only —3 when n' = 0. The levels 
with their nonrelativistic classifications are 


n' 

k 

1 

i 


0 

-3 

2 


‘Di 

1 

2 

2 

1 


1 

-2 

1 

I 

‘Pi 

2 

1 

1 

i 

‘Pi 

2 

-1 

0 

i 

‘Si 


According to (44.23) and (44.26), states with the same lA*] or j have the 
same energy; Eq. (44.27) shows that the energy increases with increasing 

i^i- 

Negative Energy States. We have seen that both the Schrodinger 
and Dirac relativistic equations admit of solutions for which a particle 
has negative kinetic energy and negative rest mtiss. These solutions 
correspond to the negative square root of the classical energy equation 
(42.2). The negative-energy solutions cannot be ignored in the quantum 
theory, as they are in the classical theory, since there is nothing to prevent 
a charged particle from making a radiative transition from a state of 
positive energy to a state of negative energy. 

Dirac proposed that we regard the negative energy states of TCq, 
(43.22) as being full, in which case the exclusion principle prevents transi- 
tions into such occupied states. The normal state of the vacuum then 
consists of an infinite density of negative-energy electrons. It is assumed 
that there are no electromagnetic or gravitational effects of these elec- 
trons, but that deviations from the norm produced by emptying one or 
more of the negative energy states can be observed. The absence of a 
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negatively charged electron that has negative mass and kinetic energy 
would then be expected to manifest itself as a positively charged particle 
that has an equal positive mass and kinetic energy. In this way, a 
'*hole’^ theory of positrons can be formulated. 

With so many electrons present, however, the theory is no longer the 
one-particle theory contemplated when the wave equation was set up. 
A many-particle theoiy can be based on the Dirac equation in accordance 
with the formalism of quantized fields discussed in the next chapter, and a 
theory of positrons can be developed. 

It might at first be thought that a similar technique cannot be applied 
to the Schrodinger relativistic equation, since it describes a particle of 
zero spin, which we expect to obey Einstein-Bose statistics rather than 
the exclusion principle. However, Pauli and Weisskopf^ have shown 
that the quantized field energy is always positive in this case, even though 
the parameter E in the wave equation can be either positive or negative. 
Moreover, the charge in the quantized field can have either sign, cor- 
responding to the ambiguity of the sign of P noted after Eq. (42.8). 
Thus both the theories discussed in this chapter predict the existence of 
particles that have positive energies and both signs of electric charge. 
The appearance of spin angular momentum as a consequence of the Dirac 
equation shows that this is the theoiy that describes electrons. 

Problems 

1. Show that the exportation vahiea of i'j® and p® for a general wave-packet solu- 
tion of Eq. (42.4) satisfy th(^ (‘(luation {E^) ==■ -h Diseuss the connection 

between this result and the classical cciuaiion (42.2). 

2, Use the nonrclativistic approximation implied in (42.11) and in the immedi- 
ately following diacMiasion to show that the expression (42.8) for P reduces to (7.1) m 
the proper limit. 

8- Solve the S(‘-hrr)ding(‘-r rcdativistic equation for an attractive square well poten- 
tial of depth 7o and radius a. Obtain an explicit expression for the minimum 7o with 
given a that just binds a pariiede of mass m. 

4. Show explicitly that the wave functions (43.17) and (43.18) are not eigen- 
functions of any cotni)<)n(mt of the sijin angular momontuni J^d'. 

6. Show tliat any matrix with two rows and columns can be expressed ns a linear 
combination of o-,, o-., and 1. Use this rea\ilt to show that there is no matrix that 

onticommutes with each of the fimt three of these. 

6. Show that th(^ (mrrent (hm.sity given by (43.20) for a free-particlo wave func- 
tion agrees with the corn‘Hi)onding noiindativistic expression in the proper limit. 

7. Make use of E(is. (43.11), (43.13), and (43.10) to verify Eq. (43.24). 

8. Prove that the opc^rators arHiid k delined by Eqs. (^14,9) and (44.10) commute 
with each other, and that is given by the right side of (44,12). 

9- Discuss tlie connec.tion between the « • E term in Eq. (43.27) and the spin-orbit 
energy. 

1 W. Pauli and V. Weisskopf, Hdv. Ada, 7, 700 (1034). 
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10. Show that the negative square roots that could appear in arriving at Eqs. 
(42.16) and (44.26) actually do not correspond to bound states. 

11. Show explicitly that elimination of 6o/ao between the two Eqs. (44.28) gives 
the correct energy levels forn' = 0 if A <0, but not if A > 0. 

12. Use the selection rules AZ » ±1, Aj « 0, ± 1 to list the frequencies of the 
allowed transitions between the states with n “ 2 and n =» 3 for the Coulomb field, in 
both the Schrfidinger and Dirac relativistic theories. In particular, show that the 
latter theory gives seven lines, of which five are distinct, whereas the former gives 
three lines that are much more spread apart. 

18. Solve the Dirac equation for an attractive square well potential of depth Fo 
and radius a. Obtain an explicit expression for the minimum V o with given a that just 
binds a particle of mass m, and compare with the answer to Prob. 3. 



CHAPTER XIII 

THE QUANTIZATION OF WAVE FIELDS 


The theory of quantum mechanics presented thus far in this book has 
dealt with systems that, in the classical limit, consist of material particles. 
We wish now to extend the theory so that it can be applied to the electro- 
magnetic field, and thus provide a consistent basis for the quantum theory 
of radiation. The quantization of a wave field imparts to it some particle 
properties; in the case of the electromagnetic field, a theory of light 
quanta (photons) results. The field quantization technique can also 
be applied to a ^ field, such as that described by the nonrelativistio 
SchrSdinger equation (6.16) or by one of the relativistic equations (42.4) 
or (43.3). As we shall see (Sec. 46), it then converts a one-particle 
theory into a many-particle theory; in the nonrelativistic case, this is 
equivalent to the transition from Eq. (6.16) to (16.1) or (32.1). Because 
of this equivalence, it might seem that the quantization of ^ fields merely 
provides another formal approach to the many-particle problem. How- 
ever, the new formalism can also deal as well with processes that involve 
the creation or destruction of material particles (radioactive beta decay, 
meson-nucleon interaction). 

This chapter and the next are intended to serve as an introduction to 
quantum field theory.^ We start in Sec. 46 with a discussion of the clas- 
sical and quantum equations of motion for a wave field, without specify- 
ing the detailed nature of the field. The application to Eq. (6.16) is 
used as a first example in Sec. 46, since the analysis is relatively simple 
and uncomplicated by considerations of relativity. The quantization 
of the Dirac equation (43.3) is carried through in Sec. 47. Several other 
particle wave equations (including the relativistic Schrodinger equation) 
have also been quantized; they are mainly of interest in connection with ^ 
meson theory, and are not discussed here. The electromagnetic field is 
considered in the next chapter. 

^For further discussion, see G. Wentzel, “Einftthning in dio Quantentheorie der 
Wellenfeldcr,” (Franz Dcutickc, Vienna, 1943; reprinted by Edwards Bros, Inc,, 
Ann Ajbor, 1946; English translation published by Interscieiice, Now York, 1948); 
W. Heisenberg, “The Physical Principles of the Quantum Theory,” Appendix, Secs. 
9-12 (University of Chicago Press, Chicago, 1930); P. A. M. Dirac, “The Principles of 
Quantum Mechanics,” 2d ed., Chapa. XI, XIII (Oxford, New York, 1936). 

829 
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46* CLASSICAL AND QUANTUM FIELD EQUATIONS 

A general procedure for the quantization of the equations of motion of 
a classical system was obtained in Sec. 23. We start with the Lagrangian 
function for the system, and verify that it gives the correct classical 
equations. The momenta canonically conjugate to the coordinates of 
the system are found from the Lagrangian, and a Hamiltonian function is 
set up. The classical Hamiltonian equations of motion are then con- 
verted into quantum equations by the substitution of commutator 
brackets for Poisson brackets. We now show how this procedure can 
be applied in its entirety to a wave field which we assume for the 

moment to be real.^ 

Coordinates of the Field. A wave field is specified by its amplitudes 
at all points of space and the dependence of these amplitudes on the 
time, in much the same way as a system of particles is specified by the 
positional coordinates qi and their dependence on the time. The field 
evidently has an infinite number of degrees of freedom, and is analogous 
to a system that consists of an infinite number of particles. It is natural, 
then, to use the amplitudes ^(r,f) at all points r as coordinates in analogy 
with the particle coordinates qiit) of Sec. 23. 

It is not necessary, however, to proceed in this way. As an alterna- 
tive, we can expand \p in some complete orthonormal set of functions wa-: 

= SaA(0^^fc(r) (45.1) 

The expansion coefficients a* in (45.1) can be regarded as the field 
coordinates, and the field equations can be expressed in terms of either 
or the a*. We shall use the wave amplitudes at all points as the field 
coordinates in this section. It will be convenient for some of the later 
work to make use of the coefficients a*. 

Lagrangian Equation. The Lagrangian Liqi^qi^t) used in Sec. 23 is a 
function of the time and a functional of the possible paths qi{t) of the 
system. The actual paths are derived from the variational principle 
(23.3): 

d f Ldt = 0, 5gt(ii) = 8qi(t2) = 0 

Ji\ 

By analogy, we expect the field Lagrangian to be a functional of the 
field amplitude i/'(r,^). It can usually be expressed as the integral over all 
space of a Lagrangian density L: 

L = /L(\^, grad t)dT (45.2) 

^ W. Heisenberg and W. Pauli, Zeits.f. Phyaik^ 66, 1 (1929); 69 , 168 (1930). 
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where ^ = d\p/dt The appearance of grad ^ as an argument of L is a 
consequence of the continuous dependence of ^ on r (continuously infinite 
number of degrees of freedom); higher derivatives of \l/ could also be 
present, but do not seem to arise in problems of physical interest. The 
variational principle that corresponds to (23.3) is 

S £ Ldt = S j*' I Ldtdr = j'* j {hL)mr = 0 (46.3) 

where the infinitesimal variation of \l/ is subject to the restrictions 

bHUi) = = 0 (46.4) 

If L has the form indicated in (46.2) , its variation can be written 

SL = ^ 8^1^+ y - 8(^\ +^84> (45.6) 

d{d\l//dx) \to/ dyp 

xyz 

where the summation over x, ?/, z implies the sum of three terms with y and 
z substituted for x. Now 5^ is the difference between the original and 
varied yj/, and hence is the time derivative of the variation of This 
and the similar expression for 8{dyl//dx) can be Avritten 

« (II) = -E- 

Equation (45.3) then becomes 

r / [i + X mk I 

The summalion terms in (45.0) can be integrated by parts with respect 
to the spacie coordinat.es; the surfa(!c terms vanish, either because ^ falls 
off rapidly enough at infinite distance, or because i/' obeys periodic 
boundary conditions at the walls of a large but finite box. The last term 
of (45.6) can be integrated hy parts with respect to the time, and the 
boundary terms vanish because of (45.4). Equation (45.0) cun therefore 
be written 

/■“ /-te - y ^ r^i_] - 1 . 0 (46.7) 

■xyz 

Since (46.3) is valid for an arbitrary variation 5^ at each point in space, 
Eq. (45.7) is equivalent to the differential equation 

dL d [ dL _ d /dL\ _ 

Lidx ld{dxP/dx)\ Jt \byl^) 


(46.8) 
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Equation (46.8) is the classical field equation derived from theLagrangian 
density L{^^, grad 

Functional Derivative. In order to pursue further the analogy with 
particle mechanics, it is desirable to rewrite Eq. (46.8) in terms of L 
rather than L. Since the aggregate of values of ^ and ^ at all points is 
analogous to the and qt of particle theory, we require derivatives of L 
with respect to and ^ at particular points. These are called funcHondl 
derivatives, and are denoted by SL/S4' and Expressions for them 

can be obtained by dividing up all space into small cells and replacing 
volume integrals by summations over these cells. The average values of 
quantities such as grad yp, and ^ in the ith cell are denoted by subscripts 
i, and the volume of that cell by 8rj. Then 

^ (grad ^)i, 

t 

approaches L in the limit in which all the hri approach zero. 

In similar fashion, the t integrand in Eq. (45.6) or (45.7) can be 
replaced by 


i i 

where the variation in L is now produced by independent variations in the 
and the Suppose now that all the and dipt are zero except for a 
particular dipj. It is natural to relate the functional derivative of L with 
respect to ip for a point in the jth cell to the ratio of dL to dip^; we therefore 
define 

dij dJb dLr d r dir 1 g\ 

dip sri-*Q SipjdTj dip dx\^d{dip/dx)\ 

xyz 

Similarly, the functional derivative of L with respect to ip is defined by 
setting all the dpi and dpi equal to zero except for a particular dpf 


dJL dL dL 

— = lim -- — = — 
dp Srj^O dpfdTj dp 


(45.10) 


Here again the point r at which the functional derivative is evaluated is in 
the jth cell. Substitution of (46.9) and (45.10) into (45.8) gives 


^ 

dt dp dp 


(46.11) 


which closely resembles the Lagrangian equations (23.4) for a system of 
particles. 
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Hamiltonian Equations. The momentum canonically conjugate to 
can be defined as in particle mechanics to be the ratio of SL to the infini- 
tesimal change S^j when all the other and all the are zero. We 
thus obtain 


Pi 


Ik 



(45.12) 


It follows from (45.11) and (46.12) that 


The analogy with Eq. (23.5) then gives for the Hamiltonian 


H = 




I 


— L 


(46.14) 


We write H as the volume integral of a Hamiltonian density Hy and 
assume that the cells are small enough so that the difference between a 
volume integral and the corresponding cell summation can be ignored; we 
then have 

H = f Hdr, H = -L, t = ^ (45.15) 

J H d\l/ 


The approximate Hamiltonian (45.14), with the relations (46.12) and 
(45.13), can be manipulated in precisely the same way as the Hamiltonian 
for a system of particles. Instead of showing this explicitly, we now work 
with the true field Hamiltonian H given in (45.15), which is a functional 
of ^ and TT from which yf/ has been eliminated. The classical Hamiltonian 
equations of motion will be derived without further recourse to the cell 
approximation. The variation of L produced by variations of ^ and yp 
can be written, with the help of (45.11) and (45.15), 


8L 


f ^ ^ = J {irdyp Trby}i)dT 

= J[5(W') + ■— 

= hH -1- 5L + J {irbyf/ — yl/8nc)dr (45,16) 


The variation of II produced by the corresponding variations of yf/ and t 
can be written 
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It follows from the earlier discussion of functional derivatives that 


dyj/ 2^ dx d{d\lf/dx) 

xyz 

iE = ^ - V — — 

dr dr 2^ dx d(dr/dx) 
xyt 

Comparison of Eqs. (46.16) and (46.17) for arbitrary variations 
Sir then gives the classical field equations in Hamiltonian form 


(46.18) 

5V' and 


, 0H . m 


(46.19) 


The Hamiltonian equation for the time rate of change of a functional 
F oi^l/ and r can now be found. We express F as the volume integral of 
the corresponding functional density which for simplicity is 

assumed not to depend explicitly on the time or on the gradients of ^ or 
TT. The foregoing analysis can be used to show that 



= [F,H] (46.20) 

This equation also serves to define the Poisson-bracket expression for two 
functionals of the field variables. The right side of Eq. (45.20) is not 
changed if F also depends on grad ^ or grad tt (see Prob. 2). It is appar- 
ent from (45.20) that ff is a constant of the motion if it does not depend 
explicitly on the time; in this case, H is the total energy of the field. 

Quantum Equations for the Field. The analogy between particle 
coordinates and momenta qi^pi and the cell averages suggests that we 
choose as quantum conditions for the field 

[ht'Pi] = [Pi,Pj\ = 0, ['l^i,Pi\ = iliSij (45.21) 

where the commutator brackets are defined by Fa\. (23.9). This means 
that we have converted the wave field from a numerical function to an 
operator that can be represented by a matrix as in Sec. 23. 

We now assume that the cell volumes are very small. Then Eqs. 
(45.21) can be rewritten with the help of (45.12) and (45.15) in terms of 
V' and tt: 

[V'(r,0,V'(r',i)] = [v(r,i),v(r',«)] = 0 

[V'(r,0,ir(r',i!)] = «'A5(r,r') 
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where 6(r,r') = l/dn if r and r' are in the same cell and zero otherwise. 
The function 5(r,r') has the property that J'/(r)5(r,r')dT is equal to the 
average value of / for the cell in which r' is situated. Thus in the limit in 
which the cell volumes approach zero, 5(r,r') can be replaced by the three- 
dimensional Dirac 5 function 5(r — r'), defined in Eq. (11.14). The 
quantum conditions for the canonical field variables then become 


[i//(r,0,x(r',0] = - r') 


(45.22) 


The equation of motion for any quantum dynamical variable F is 
obtained as in Sec. 23 if the Poisson bracket in Eq, (45.20) is replaced by 
the commutator bracket divided by ih. 


ifiF = [F,H] 


(46.23) 


The commutator bracket can be evaluated with the help of (46.22) when 
explicit expressions for F and H in terms of ^ and tt are given. Thus Eqs. 
(45.22) and (45.23) completely describe the behavior of the quantized 
field that is specified by the Ilamiltonian H, 

Fields with More than One Component. Thus far in this section we 
have dealt with fields that can be described by a single real amplitude. If 
the field has more than one component i/'i, 1 /^ 2 , ... , the Lagrangian den- 
sity has the form I(^i, grad \l/i, i/'i, 1 /^ 2 , grad ^ 2 , - . - ,i). Then if 
each of the field components is varied independently, the variational 
equation (45.3) leads to an etiiiation of the form (45.8) or (45.11) for each 
of ypiy . . . . A momentum canonically conjugate to each can be 
defined as in Eq. (45.15) to be 7r« = dL/d\l/g. The Hamiltonian density 
has the form 

(45.24) 


if = ^ - L 


and the Hamiltonian e(iuati()ns (*,onsist of a pair like (45.19) for each s. 
Equation (45.23) is unchanged, and the commutation relations (46.22) are 
replaced by 

fi/'.(r,/),i/.«/(r',/)] = [7r«(r,^),7r«/(r',0] = 0 
= ih8,,^8(T - P) 


(45.26) 


A case of immediate interest is a single complex field 1 /', for which we 
can write 

rp = 2"i(’/'i + ^ ' 1 ^ 2 ) (45.26) 

where \pi and ^2 are real. We show first that the equations of the form 
(45,8) obtained by independent variation of i// and of ^ arc equivalent to 
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those obtained by variation of and ^ 2 . It follows from (45.26) that 



Thus the sum and difference of the ^1 equation and i times the ^2 equa- 
tion give the Lagrangian equations that result from independent varia- 
tion of if and respectively, in (45.3). In similar fashion, the momenta 
canonically conjugate to ^ and ^ are seen to be 

TT — 2-i(7ri — zVa) and it = + m) (45.27) 

respectively.^ Then tti^i + and the Hamiltonian is 

unchanged. The commutation relations for if, t, and fr can be obtained 
from Eqs. (45.25) (with 5 = 1 and 2), together with (45.26) and (45.27). 
All pairs of variables except the following commute: 

[^(r,<),7r(r',0] - i^5(r - r') ... 28) 

= ^‘A5(r - r') ^ * 

46. QUANTIZATION OF THE NONRELATIVtSTIC SCHR6DINGER EQUATION 
As a first example of the application of the field quantization technique 
developed in the preceding section, we consider here the quantization of 
the nonrelativistic Schrodinger equation (6.16). This application implies 
that we are treating Eq. (6.16) as though it were a classical equation that 
describes the motion of some kind of material fluid. As we shall see, the 
resulting quantized field theory is equivalent to a many-particle Schr5- 
dinger equation, like (16.1) or (32.1). For this reason, field quantization 
is often called second quantization; this term implies that the transition 
from classical particle mechanics to equation (6.16) constitutes the first 
quantization. 

Lagrangian and Hamiltonian Equations. The Lagrangian density 
may be taken to be 

L = iftif^ — A- grad if • grad ^ (46.1) 

As shown at the end of the last section, yp and ^ can be varied separately to 
obtain the Lagrangian equations of motion. The equation of the form 
(46.8) that results from variation of yp is 

-iM = + 7(r,0tf 

which is the complex conjugate of Eq. (6.16). Variation of ^ gives Eq. 
(6.16): 

1 See footnote 1, page 337. 


Sdo. 46] 


THE QUANTIZATION OF WA7E FIELDS 


337 


= -^VV+ (46.2) 

The momentum canonically conjugate to yp is 

r = (46.3) 

d\y 

However ^ does not appear in the Lagrangian density, so that f- is identi- 
cally zero, ^ It is therefore impossible to satisfy the second of the commu- 
tation relations (45.28) (or the corresponding classical Poisson-bracket 
relation), so that cannot be regarded as a pair of canonically conjugate 
variables. They can easily be eliminated from the Hamiltonian since ff 
never appears and Eq. (46.3) gives ^ in terms of tt.^ 

The Hamiltonian density is 

H = mp - L == - ^ grad t • grad ^ Vinp (46.4) 


The Hamiltonian equations of motion obtained from (45.19), with the 
help of (45.18), are 




= -Vir 
h 




The first of these equations is the same as (46.2), and the second equation, 
together with (46.3), is the complex conjugate of (46.2). We have thus 
shown, from the point of view of classical field theory, that the Lagrangian 
density (46.1), and the canonical variables and Hamiltonian derived from 
it, are in agreement with the wave equation (6.16) or (46.2). 

Quantum Equations. The quantum equations are obtained by 
adopting the volume integral of (46.4) as the Hamiltonian, (45.23) as 
the equation of motion, and the fii-st of Eqs. (45.28) as the quantum condi- 
tion on the wave field. Since p is now an operator rather than a nu- 

1 The notation of Eqs. (46.27) do(^s not necicsaarily imply that tt is the complex 
conjugate of tt, since these quantities were definod as canonical momenta. It is not 
difficult to see that t and x are actually complex conjugates of each other if Zr is real; 
in this case the xi and xa that appear in (46.27) arc also real. 

^ The conclusion that x can be identified with ^ is related to the appearance of only 
the first-order time derivative in the wave equation (46.2), since in this case \p can be 
expressed in terms of \l/ and its space derivatives through the wave equation. If the 
wave equation is of second order in the time derivative, \J/ and p are independent; then 
X is related to J rather than to if, and both ^,x and ^,x are pairs of canonical variables. 
The nonrelativistic SchrOdinger equation and the Dirac equation are of the former 
type, while the relativistic SchrOdinger equation is of the latter type. 
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merical function, ^ is to be interpreted as the Hennitian adjoint of ^ rather 
than as its complex conjugate. We therefore follow the notation of Sec. 
21 and denote it by V'*- It is convenient to make use of Eq. (46.3) to 
replace x by ih!/*, in which case the Hamiltonian becomes 

^ = / grad yj,* . grad ^ + Vyp*i^dr (46.6) 

Application of Eqs. (21.14) and (21.15) then shows that H is Hennitian. 
The quantized Hamiltonian given in (46.5) is the operator that represents 
the total energy of the field; it is not bo be confused with the operator 
given in (22.2), which is the energy operator for a single particle that is 
described by the wave equation (6,16) or (23.1). We have as yet given 
no explicit representations for the new operators and Hy and therefore 
cannot say on what they operate. The choice of a particular representa- 
tion is not necessary so far as the equations of motion are concerned, but 
is desirable in the physical interpretation of the formalism that we give 
later in this section. 

The commutation relations are 


[’/'(r),^(rO] = [^*(r),iA*(r')] = 0 

[V'(r),V^*(r')] - 5(r - r') 


(46.6) 


The omission of t from the argument of the field variables implies that 
both fields in a commutator bracket refer to the same time.^ 

Substitution of for F in Eq. (45.23) gives 


ihyl/ 


.f,H] 

/i j (46.7) 


where primes indicate that an integration variable r' has been substituted 
for r. The second term on the right side is easily evaluated with the 
help of (46.6) to give 


= J7V'5(r - r')dT' = (46.8) 

yp commutes with 7, which is a numerical function. Evaluation of the 
first term on the right side of (46.7) is simplified by performing a partial 
integration on Jgrad' ■ grad' xl/'dr' to obtain - the surface 

terms vanish because yj/ either vanishes at infinity or obeys periodic bound- 


1 Coi^utation relations between quantities that refer to different timoH Jire of 
interest in connection with relativistic theories (see Sec. 47). 
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aiy conditions. We thus obtain 
[^y/grad' 1 ^*' • grad' }p^dr'] = — 

= --J(V'y)5(r - r')dT' = -vV (46.9) 

Substitution of (46.8) and (46.9) into (46.7) yields Eq. (46.2), so that the 
equations obtained from classical and quantum field theory agree. A 
similar calculation shows that the equation yields the 

Hermitian adjoint of Eq. (46.2); it can also be seen directly that this 
equation is the Heinaitian adjoint of the equation ih\l/ = so long as 
H is Hermitian. 

If y is independent of t, H has no explicit dependence on the time, 
and Eq. (45.23) shows that 7/ is a constant of the motion. Thus the 
energy in the field is constant. Another interesting operator is 


N = 

which we assume represents the number of particles in the field. We note 
first that N is Hermitian. Its time derivative is given by 


zhN = 


N,H] 


j J grad' i/'*' • grad' dr' (46.10) 


The commutator of N with the V part of II can bo written as 
with the help of (4r).()) ihe parenthesis in the integrand is 


= - r') - 

= - 5(r - r')]^ + - r') 


= 0 


since the 5 function vanishes unless r = r'. A similar but slightly more 
complicated calculation shows that 


grad' t//*' • grad' xf/'] = fi/'* grad' — (grad' i^*')^] * grad' S(r — r') 

The double integral of this over r and r' is zero. Thus E(i. (46.10) shows 
that JV is a constant of the motion. 

It can also be shown that the commutator brackets in (46.6) are con- 
stants of the motion, so that they are always true if they are true at a 
particular time. 
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The N Representation. We now specialize to a representation in 
which the operator N is diagonal. Since N is Hermitian, its eigenvalues 
are real. A convenient and general way of specifying this representation 
is by means of an expansion like (46,1) in terms of a complete orthonormal 
set of functions Wft(r), which we assume for definiteness to be discrete. 
We put 

if'W) = y ai(<)«*(r), = y a%{t)uk{i:) (46.11) 

K h 

where the Uh are numerical functions of the space coordinates and the a*, 
are operators that depend on the time. Equations (46.11) can be solved 
for the ua,: 


ajb(0 = /tZ*(r)^(r,i)dr, aj = iuk{t)yp'^{ryt)dr 


Thus if we multiply the last of the commutation relations (46.6) by 
^jfe(r)wz(r') on both sides and integrate over r and r', we obtain 

[a*(0, af(0] = J/wA(r)'Wi(r')5(r - rOdrdr' = hi (46.12) 


because of the orthonormality of the Uh. In similar fashion, it is apparent 
that ah and ai commute, and that aX and a* commute, for all k and Z. 
Substitution of (46.11) into the expression for N shows that 



(46.13) 


It is easily seen that each Nk commutes with all others, so that they can 
be diagonalized simultaneously. 

In order to find a representation in which N and all the Nh are diag- 
onal, we write each of the a* in the form 

ah = 2-4(0^* + iVh)i aX = 2-4(g/fc ~ ip*) (46.14) 

where qk and p* are Hermitian. This is always possible since Eqa. 
(46.14) can be solved to give 

Qh — 2-4(a* + aj), p* = — i2“4(afc — at) 

and these operators are evidently Hermitian. It follows from (46.12) 
that 

[S'AjS'd = [Pk,Vi\ = 0, [g*,p,] = ihi (46.16) 

and that 

Nh = i(p| + 5h) “ i (46.16) 

Equations (46.15) and (46.16) have the advantage over the equivalent 
earlier equations (46.12) and (46.13) that their solution has already been 
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obtained in connection with the theory of the linear harmonic oscillator. 
We now show that some of the results of Sec. 13 provide explicit matrices 
for the pjfc and g* that make the Nk diagonal. 

Connection with the Harmonic Oscillator. The quantum motion of a 
particle of mass m acted on by a force where x is the displacement 

from equilibrium, was discussed in Sec. 13 from the point of view of the 
Schr5dinger equation. As shown in Sec. 22 and 23, the solution of this 
problem is equivalent to the diagonalization of the energy matrix 

when the coordinate x and the canonically conjugate momentum p satisfy 
a commutation relation like (23.13) : 

[Xjp] — ih 

The energy eigenvalues were found to be given by Eq. (13.8) : 

n = 0,l,2, ••• 

The matrix for x in the representation in which the energy is diagonal is 
given by Eqs. (13.18).^ 

If now we identify x with g*, V ’with pkj and set h, m, and K equal to 
unity, we see at once that Nk + i can be identified with the energy of the 
oscillator and has the eigenvalues + -y, where n* is a positive integer or 
zero. Equations (13.18) then show that the matrix for g^ has the form 

(g*)n..n.+l = (46-17) 

with other matrix elements vanishing. A calculation similar to that 
which resulted in (13.18) can be used to show that the matrix for Ph has 
the foim 

(46.18) 

with other matrix elements vanishing. The matrices for a* and a* can 
then be obtained from (46.14): 

(afc)n*.n*+l = (a*)nfc+l.n* = (^ft + 1)* (46.19) 

All other matrix elements vanish. Equations (46.17), (46.18), and (46.19) 
imply that the nonvanishing matrix elements refer to pairs of states for 

^ These results can also be obtained by purely matrix methods, without explicit 
solution of the Schrodingcr equation; see Dirac, op. cU.^ Sec. 36, 



342 QVAl^TVM MECHANICS [Chap. XIII 

which all the other ni have the same values, since Phj dhj and at each 
commute with Ni for I k. 

The matrix for x in the harmonic-oscillator problem can be related to a 
set of wave functions Un{x) given by (13.13), such that 

Xnn' = Jun(x)xun'ix)dx. 

We also expect that the matrices for a*, and a* can be related in a 

similar way to some kind of quantities that play the role of wave functions 
in the quantized field theory. We call these quantities wave functionals 
^ of the numbers Uk] they can be thought of as one-column matrices that 
have the properties 


) 

“ 

^*{nif . . . jTijfj . . . . . . jnjgj . . . ) (46.20) 

“ (nje "b l)^5nini'! * ' * 5n*+l,n*' 

. . . ,n*, . . . )4^«, 

= nk^Snim' * * * lin*' 

in agreement with Eqs. (46.19). Equations (46.20) are equivalent to 
the statement that the 4^'s are orthonormal and satisfy the relations 

ah^ini, . . , yUk, . . . ) 

= 71*4^(72.1, ... ,72* - 1, ... ) (46 21) 

aj^(72i, . . . ,71*, . . . ) 

= {nja + 1 ) 44 ^( 721 , ... ,7?.* + 1, ... ) 

Physical Interpretation. Since we regard N as the operator for the 
total number of particles in the quantized field, it is reasonable to assume 
that W* is the operator for the number of particles in a state of the field that 
is described by the space function 72*(r). We thus have the result that 
a precise measurement of the number of particles in any state must 
be one of the positive integei-s or zero. Equation (46. 13) then shows that 
this also applies to the total number of particles. 

Although AT is a constant of the motion, iV* need not be. Substitu- 
tion of JV* for F in Eq. (45.23) gives 

ihNk = [alak.H] 

H can be expressed in terms of the a* by substituting (46.11) into (46.6). 


= ^ a^ai J grad Uj • grad ui 4- VujUi )rfr 


(46.22) 
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It is not difiS-Cult to show from (46.12) that a particular Nk is constant if 
and only if all the volume integrals in (46.22) are zero for which either 
y or I is equal to A*. These integrals are just the matrix elements of the 
one-particle Hamiltonian (22.2), so that the necessary and sufficient con- 
dition that iV* be a constant of the motion is that all such off-diagonal 
elements that involve the state Uk be zero.^ 

The case in which the Uk are eigenfunctions of (22.2) with eigenvalues 
Ek is of particular interest. The integrals in (46.22) are then EiB^i, and 
the field Hamiltonian becomes 

TI = ^ atakEu = ^ NA (46.23) 

k k 

This particular N representation is one in which H is also diagonal; the 
wave functional ^(ni, . . . ,nfr, . . . ) has the eigenvalue ^ UhEk for 

the total-energy operator H. It is apparent that all the Nk are constant 
in this case. 

The first of Eqs. (46.21) permits the interpretation of a* as a destruction 
operator for particles in the state k, since it converts a wave functional into 
a multiple of one that has one less particle in that state. Similarly, 
aj can be interpreted as a creation operator ^ since it increases the number 
of particles in the klh state by unity. 

Connection with the Many-particle Schrodinger Equation, The 
quantized field tlieory is closely related to the many-particle Schrodinger 
equation discussed in Sec. 32. If the Uk are eigenfunctions of the one- 
particle Hamiltonian (22.2) , the field theory shows that stationary solutions 
exist for whi(?h the number of particles Uk in the A'th state is a con- 
stant positive integer or zero, and th(^ energy is ^ Each solution 

k 

can be specified by a wave functional '4^(^i, . . . . . . ); the ''P^s 

form a complete set, and there is just one solution tor eacdi set ol numbers 
^ 1 , . . . . A stationary many-pai’ticle wave function like the ^ in 
Eq. (32.1) can be writtem as a pi'oduet of one-particle wave functions 

iEki 

iijt(r)e if there is no interaction botwecui particles. The linear com- 
bination of siKih products that- is symmetric with respect to interchange of 
any pair of particle (u)ordinat(^s can be spedfied uniquely by stating the 
number of partides in eacdi state. Again, the number of particles in 
each state is a i)()sitive int.(^g(M* oi- z(n‘(), and the energy is the sum of all 
the parti(!lo (mergies. 

iThia roBult for the quantized field is cltiHcdy ndatod to th(^ correnponcliuK result, 
contained in Kq. (2t).r)), for the oiu'-piirt icln i)rolml)ility amplitude. 
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We see then that the quantized field theoiy developed thus far in 
this section is equivalent to the Schrodinger equation for several non- 
interacting particles, provided that only the symmetric solutions are 
retained in the latter case. We are thus led to a theory of particles that 
obey Einstein-Bose statistics. It can be shown that the two theories are 
completely equivalent even if interactions between particles are taken 
into account. 1 

It is natural to see if there is some way in which the quantized field 
formalism can be modified to yield a theory of particles that obey Fermi- 
Dirac statistics. As discussed in Sec. 32, a system of such particles can 
be described by a many-particle wave function that is antisymmetric with 
respect to interchange of any pair of particle coordinates. The required 
linear combination of products of one-particle wave functions con be 
specified uniquely by stating the number of particles in each state, pro- 
vided that each of these numbers is either 0 or 1. The desired modifica- 
tion of the theory must, therefore, limit the eigenvalues of each operator 
Nh to 0 and L, 

Anticommutation Relations. A review of the foregoing theory shows 
that the conclusion that the eigenvalues of each Nk are the positive 
integers and zero stems from the commutation relations (46.12) for the 
ah and a*. Equations (46.12) in turn arise from the conunutation rela- 
tions (46.6) for ^ and ^*. Thus we must modify Eqs. (46.6) if we are to 
obtain a theory of particles that obey the exclusion principle. It is 
reasonable to require that this modification be made in such a way that 
the quantum equation of motion for ^ is the wave equation (46.2) when 
the Hamiltonian has the form (46.5). 

It was found by Jordan and Wigner^ that the desired modification con- 
sists in the replacement of the commutator brackets [A,B] ^ AB — BA 
in Eqs, (46.22) and (46.6) hy anticommutator brackets [A,B]+ = AB + BA. 
This means that Eqs. (46.6) are replaced by 

[^(r)»!/'(r')I+ = ^(r)^(rO + \^(rO^(r) = 0 
[^*(r),i^*(r')]+ = ^*(r)^*(rO + ^*(r')!/'*(r) = 0 (46.24) 

[^(r),TA*(r')]+ = + ^*(r')^(r) = 5(r - rO 

It then follows directly from Eqs. (46.11) and (46.24) that 

[akjai]^ = akGi + aiak = 0 

+ afa? = 0 (46.26) 

[a&,a?]+ = Ofcaf + afa* = 8ki 


^ See Heisenberg, op. di., Appendix, Sec. 11. 

» P. Jordan and E, Wigner, Zeits.f. Physik, 47, 631 (1928). 
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We define Nh = before, and notice first that each Nu com- 

mutes with all the others, so that they can be diagonalized simultaneouBly. 
The eigenvalues of Nk can be obtained from the matrix equation 

= a^aka^ak 

= at {I - atak)ak - aja* = Nk (46.26) 

where use has been made of Eqs. (46.25). If Nk is in diagonal form and 
has the eigenvalues n'k, n* , . . • , it is apparent that Nl is also in diagonal 
form and has the eigenvalues .... Thus the matrix equation 

(46.26) is equivalent to the algebraic equations 

«'*? = K, = <,••• 

for the eigenvalues. These are quadratic equations that have two roots: 
0 and 1. Thus the eigenvalues of each Nh are 0 and 1, and the particles 

obey the exclusion principle. The eigenvalues of iV = ^ JV* are the 

k 

positive integers and zero, as before. The earlier expressions (46.22) and 
(46.23) for the Hamiltonian are unchanged, and the energy eigenvalues 



Equation of Motion. In order to find the quantum equation of motion 
for V' when the Hamiltonian is given by (46.5), we must decide whether 
or not the general equation of motion (45.23) is to be retained. The latter 
equation was obtained by replacing the Poisson bracket by the com- 
mutator bracket in the classical equation (45.20). This replacement was 
justified by analogy with the particle theory of Sec. 23, by the identical 
algebraic properties of the two kinds of bracket expressions given in 
(23.12), and by the correspondence-theory argument of Prob, 10, Chap. 
VI. Thus abandonment of Eq. (45.23) means that the classical equation 
(45.20) is likewise abandoned; since many of the quantities with which we 
are concerned (number of particles, energy, etc.) have well-defined clas- 
sical analogues, we shall retain (45.23) as the general quantum equation 
of motion. 

The equation for is then given by (46.7), where now the anticom- 
mutation relations (46.24) arc to be used in evaluating the right side. 
This causes (46.8) to be replaced by 

J7'(^*y - ^*VV)dT' = 

= ;7V'5(r - r^)dr^ = V^p 


The similar treatment of the first term on the right side of (46.7) gives 
no change in the right side of (46.9). Thus the wave equation (46.2) is 
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unaffected by the substitution of anticommutation for commutation rela- 
tions between the wave amplitudes. It can also be shown without dif- 
ficulty that N and the anticommutator brackets in (46.24) are constants 
of the motion. 

Physical Implications of Anticommutation. Since anticommutator 
brackets do not possess the algebraic properties of Poisson brackets, we 
can conclude that there is no classical analogue for the quantities ^ and 
a* that satisfy the relations (46.24) and (46.26). This does not mean, 
however, that N and H do not possess classical limits, for they are con- 
structed of bilinear combinations of yp or a* and commute with each other. 

These conclusions can be confirmed on the basis of physical considera- 
tions. In order that a field amplitude be strong enough to be classically 
measurable, it must be possible to have a very large number of particles 
in the same state so that their fields are coherent. This implies that such 
particles obey Einstein-Bose statistics. We can, for example, conclude 
that light quanta or photons obey Einstein-Bose statistics, since it is 
known that strong electric and magnetic fields can be produced and are 
classically measurable. In the case of metallic electrons, which obey 
Fermi-Dirac statistics, quantities like energy, and charge and current 
density, are classically measurable since they can be expressed as bilinear 
combinations of the field amplitude, while the electron field amplitude 
itself is not.^ 

Representation of the Anticommuting a* Operators. An explicit 
representation for the operators that appear in (46.25) is easily obtained 
in the hypothetical but instructive case in which the system has only one 
state. The matrix equations to be solved are 

^2 _ ^*2 ™ ^ (46.27) 

Equations (46.27) are just those solved in Prob, 2, Chap. VI. We have 
already seen in (46.26) that iV^ = iV, so that N has the eigenvalues 0 and 
1. If there is no degeneracy, N can be represented by the diagonal matrix 

JV = J) (46.28) 

It is interesting to note that a cannot be diagonalized since N has a non- 
vanishing eigenvalue. If it could be, the first of Ecjs. (46.27) would show 

^ The classical measurability of the field amplitude for any charged particlo 
(Einstein-Bose or Eermi-Dirac statistics) implies that ^ appears lincai*ly in Hj since 
the energy must then depend on itself as well as on bilinear combinations of i//. This 
in turn implies that terms linear in a*, or aj appear in the Hamiltonian, so that single 
charged particles can be destroyed or created. Thus ^ cannot measured classically 
if the theory is such that electric charge is conserved. 
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that the square of each of its eigenvalues is zero; this would mean that a, 
and hence a* and iV, are identically zero, so that N could not have the 
form (46.28) in any representation. 

Explicit matrices for a and a* that agree wth (46.27) and (46.28) are 

“ = i) “*-(! 2) 

The two wave functionals that represent the two possible states of this 
system can be expressed as 

^(0) = (J). ^(1) = (J) (46.30) 

It is easily seen that the first of (40.30) has the eigenvalue 0 for the opera- 
tor N given in (46.28), and the second wave functional has the eigenvalue 
1. The following relations are easily obtained from Eqs. (46.29) and 
(46.30): 

a^(n) = n^(l — n), a*4'(n) = (1 — n)^(l — n), n = 0,1 (46.31) 

Thus a and a* again play the roles of destruction and creation operators, 
respectively. 

In the actual problem, the number of states of the system is infinite, 
and it is not convenient to write down explicit matrices like those in 
(46.28), (46.29), and (40.30). We can, however, find the effects of opera- 
tion with ctfe and a* on a wave functional 'P(ni, . . . . . . ) that has 

the eigenvalue (= 0 or 1) for the operator Nk. The desired relations 
would have the form (40.31), were it not that a scries of such equations 
(with subscripts added) would not agree with the first two of Eqs. (46.26). 

We therefore proceed in the following way. We order the states k of 
the system in an arbitrary but definite way: 1, 2, . . . , /r, . . , . Then 

the effect of operating with (^iich ak or a* on has the form (46.31), 

except that a multiplying plus sign or minus sign is introduced, according 
as the Hh state is preceded in the jissumcd order by an even or an odd 
number of occupied states. We thus replace Eejs. (40.21) by 

. ) 

= OkthA'ini, ... ,1 - ajfc, . . . ) 

= 0,{\. - . . . ,l - . . . ) (46-32) 

fc -1 

t 


afc^(ni, . . . .riky . . 
af^(ni, . . . ,74, . . 


Ok = (- 1 )-, 


Vk = 
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As an example, we calculate the effect of operating with a^ai and with 
aiajc on a wave functional where we assume for definiteness that the 
order is such that i > /r. If each operation is not to give a zero result, 
both n* and ni in the original ^ must equal unity. Operation with asaj 
empties first the Zth and then the A*th state, and introduces a factor diQk- 
Operation with aia^ empties the frth state first, so that dk is unchanged. 
But when the Zth state is emptied in this case, there is one less particle in 
the states below the Zth than there was in the previous case, since the /rth 
state is now empty, whereas it was occupied before. Thus the sign of 
6i is changed. We find then that in agreement with 

the first of Eqs, (46.25). In similar fashion, it can be shown that Eqs. 
(46.32) agree with the result of operating with the other two of Eqs. 
(46.26) on any Since the aggregate of wave functionals represents 
all possible states of the many-particle system, the constitute a com- 
plete set, and Eqs. (46.26) follow as operator equations from Eqs. (46.32). 

47. QUANTIZATION OF THE DIRAC EQUATION 

As our second example, we now consider the quantization of Dirac's 
relativistic equation (43.3) for a free electron. Our procedure again 
consists in treating the one-particle equation as though it were a classical 
field equation. The resulting quantized field theory represents the 
motion of a number of noninteracting free electrons. 

Lagrangian and Hamiltonian Equations. The Dirac wave function ^ 
has four components that we denote by {j = 1, 2, 3, 4). The Lagran- 
gian density may be taken to be 

L = (ihyp^ — ihc ^ an • grad + mc^ ^ (47.1) 

3 i i 

where the matrices ctji and ftz are given in Eqs. (43.12). We must now use 
the multicomponent extension of the field theory described at the end of 
Sec. 46. Variation of one of the components of ^ leads to an equation 
of the type (45.8) : 

mc^ ^ X grad • aji — ih\l/i = 0 (47,2) 

3 3 

The four equations like (47.2) may be written together as 
+ ihc grad ^ • a + = 0 

where is the Hermitian adjoint of ^ and is a matrix with one row and 
four columns ; this equation is the Hermitian adj oint of (43.3) . In similar 
fashion, it is easily shown that separate variation of the four fields 
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results in four equations that can be written together as 

ih\l/ — ihcQL • grad = 0 (47.3) 

which is the same as the Dirac equation (43.3). 

The momentum canonically conjugate to is 

iry = (47.4) 

As with the nonrelativistic SchrSdinger equation, the momentum itj 
canonically conjugate to is identically zero; we therefore eliminate 
from the Hamiltonian with the help of (47.4). Then the Hamiltonian 
density is 

H = ^ - L 

J 

= c ^ TTjaji • grad \pi + ^ (47.6) 

It is not difficult to show (see Prob. 12) that the first of the Hamiltonian 
equations (45.19) is the same as (47.3), and that the second Hamiltonian 
equation, together with (47.4), giyes the Hermitian adjoint of (47.3). 
This completes our demonstration that (47.1), (47.4), and (47.6) are in 
agreement with the Dirac equation. 

Quantum Equations. As with the nonrelativistic Schrodinger equa- 
tion, it is convenient to rewrite the Hamiltonian with the help of (47.4) 



= j{ihc\l/'^ot ■ grad ^J/ — 7nc^\l/^^^l/)dT (47.6) 

In spite of its appearance this expression is actually real, as can be seen 
by performing a partial integration on half of the first term to obtain 

H = ■ grad t/' — grad • a^) — (47.7) 

The surface terms vanish because \f/ either vanishes at infinity or obeys 
periodic boundary conditions. 

We have already seen (Sec. 32) that electrons obey the exclusion 
principle. We therefore quantize the field by imposing anticommutation 
relations on the components of \l/. With the help of (47,4) these become 

[UT)Mt')]^ = [^r(r),^f(r')]-H = 0 

[V'i(r),^z*(r')]+ = 8n8(r - r') 


(47.8) 



quantum mechanics [Chap, XIII 

The substitution of ypf for implies that each ypj is now a quantum- 
mechanical operator rather than a numerical function; we interpret to 
mean the matrix that has one row and four columns, whose elements are 
the operators 

The quantum equation for yj/ is obtained by substituting for F in 
Eq. (45.23), where H is given by (47.6) or (47.7). 

iMi = \fu f (ihc y ^Pt'du • grad' - mc^ V ft'M'i) o^t'I ( 47 . 9 ) 

Prunes indicate that an integration variable r' has been substituted for r. 
The second term on the right side can be evaluated with the help of (47.8) 
as follows: 

[4'i, f -mc^y^rM'idr'] 

= -mc^ y f 

= f - m',yPf)dr' 

kt 

= -mc*y f + 4>rMdT’ 

ki 

= —me* ^ PkiSjh J <A|3(r — r')dT' = —mc^ ^ 

Evaluation of the first term on the right side of (47.9) is accomplished in 
the same way, since i/-, anticommutes with grad' as well as with i/-', : 

[4'i, f ihc y V'raw • grad' f^dr'] = ihc V ctn ■ grad 

kl ^ 

Thus the four equations like (47.9) are equivalent to the Dirac equation 
■I /* ' ^ ®™iiar calculation shows that the four ecpiations of the form 

ih\pj = yield the Hermitian adjoint of (47.3). 

The operator N for the total number of electrons in the field can be 
written 

N = I i;*4,dT = / ^ 'I'i'l'idr (47.10) 

J 

It IS apparent that N is Hermitian, and it can also be shown that 

aiV = [iV,H] = 0 

so that N m a constant of the motion (see Prob. 13). As with the non- 
relativistic theory of See. 46, the anticommutator brackets in (47.8) can 
be shown to be constant in time. 
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The N Representation. A representation in which. N is diagonal is 
conveniently found by expanding ^|/ in plane wave solutions of the Dirac 
equation for a single free electron. The entire procedure is exactly 
analogous to that followed in the last section, but is complicated by the 
multicomponent nature of the Dirac field. We shall continue to denote 
the four components of t/' and of the plane waves by a subscript j or 1. 
The propagation vector of a plane wave (momentum divided by h) will 
be denoted by k, and it will be assumed that these waves obey periodic 
boundary conditions at the walls of a large cubical box of edge length L, 
We saw in Sec. 43 that there are four solutions for each value of k; 
these will be distinguished by the letter 5 ( = 1, 2, 3, 4), so that both k and 
s must be given in order to specify a one-electron plane wave solution. 
The free electron solutions that form a complete orthonoimal set of func- 
tions within the volume U are then 


?;,-(k,s;r) = ti,'(k,s)L“8 exp tk • r (47.11) 

The wy(k,s) are numbers obtained by multiplying the four sets of Ui given 
in Eqs. (43,17) and (43.18) through by the normalizing factor given there. 
The two solutions (43.17) will be denoted by s = 1, 2, and correspond to 
the two spin orientations for the positive energy; 


/?k, = 

-|-(fiV'^* + mV)i, 

s = 1,2 

(47.12) 

The two solutions (43.18) arc 

for negative energy ; 



/i’k. = 

-(/iVk» + 

a = 3,4 

(47.13) 


The orthonormalily Halations for the of Eq. (47.11) are easily seen to be 


J 2 ?iy(k,.s*;r)?v(k'/;r)f/T = 


(47.14) 


As in Eqs. (40. L i), we expiiiid i^/ and in terms of the vf, 

'I'iiU) = y r) 




ka 


(47,15) 


The coefficiciitH a and a* are (niantum.-mechanic!al operators that depend 
on the time. Witii the; help of the orthononnality ijroperty (47.14) it is 
not difficult to show that the anticommutation relations (47.8) are equiva- 
lent to 


[rt(k,,s;0,a(k',.>!';0]i- = fa*(k,.-!;0,a*(k',.s'; /)]+ = 0 
[a(k,.s;0,a*(k',s';0]+ = Sick'S,.' 


(47,16) 
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Substitution of (47.16) into (47.10) shows that 

N ~ where A(it. = o*(k,s;i)o(k,8;i) (47.17) 

ke 

The work of Sec, 46 shows that the eigenvalues of each iVks are 0 and 1, 
so that there cannot be naore than one electron in each state (which is 
specified by the spin orientation, the value of the momentum, and the 
sign of the energy). Explicit expressions for the a^s, analogous to (46.32), 
can also be written down. 

Substitution of (47.16) into the expression (47.6) for the field Hamil- 
tonian gives 

^ “ X / ~ mcV*/3/zWrfr 

a*(k,5;0a(k'js';^)^^y(k,5;r) 

•{ihcdii • grad — mc%i)viOL' ]t)dT 

The solution of Eqs. (43.16) shows that the Vj defined by (47.11) satisfy 
the equations 

y (ihcdfl • grad - ;t) = 

T 

We then obtain, with the help of (47.14), 




(47.18) 


where the are given by (47.12) and (47.13). It follows at once that 
the N^s are constants of the motion, since they commute with each other 
and hence with H. 

Negative Energy States and Positrons. All the results obtained thus 
far in this section can be obtained from the assumption that the and 
fit's commute, as well as from the assumption that they anticommiite. 
This makes it seem that the Dirac theory can describe either particles of 
spin that obey Einstein-Bose statistics, or electrons that obey the 
exclusion principle. It is easily seen, however, that the field energy 
operator (47.18) has negative eigenvalues of arbitrarily large magnitude, 
which correspond to electrons in negative energy states (s = 3,4). The 
existence of such eigenvalues implies that there is no equilibrium con- 
dition of the field when electromagnetic interactions are taken into 
account, since an electron will make radiative transitions to states of lower 
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and lower energy. This difficulty cannot be avoided so long as the 
particles are assumed to obey Einstein^Bose statistics. 

It was proposed by Dirac that the undesirable transitions to negative 
energy states be eliminated from the theory by the assumption that the 
normal condition of empty space is that in which all positive energy states 
are empty and all negative energy states are full: 

JVki = iVk 2 = 0, I for all k (47.19) 

This is an equilibrium condition of the field, since the exclusion principle 
prevents transitions into the negative energy states. It is also assumed 
that the infinite density of negative-energy electrons produces no observ- 
able electromagnetic or gravitational effects, but that departures from 
the vacuum values (47.19) are observable in the usual way. We there- 
fore subtract the vacuum values 



for the total electric charge and total energy, from the total charge and 
energy operators cN and H, respectively; e is the (negative) charge of an 
electron. 

The resulting expression for the total observable charge is 


k «-l,2 s-8,4 


ATi 


ka 


l-iVk3 = a(k,s;0a*(k,s;0 (47.20) 


The new operator iV'ka' has the eigenvalue 0 when the state ka is full and 
the eigenvalue 1 when that state is empty. The similar expression for the 
total observable energy is 


^ ^ J NW\Eu\) 


(47.21) 


According to (47.20), each positive-energy particle acts like a negative 
electron, and each missing negative-energy particle like a positive elec- 
tron; (47.21) then shows that the observable energy is positive, and equal 
to the sum of a positive term for each positive-energy particle and for 
each missing negative-energy particle. It is therefore reasonable to 
interpret the holes’^ in the otherwise occupied set of negative energy 
states as positive electrons or positrons. Because of the relation (47.13) 
between energy and momentum, positrons have the same rest mass as 
electrons. The existence of positrons was predicted by Dirac on the basis 
of this theory in advance of their discovery in cosmic radiation, ^ 

^ For further diBcnission of tho formal aspects of positron theory, see Wentzol, 
op. cxi., Chap. V. 
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We have seen that the Dirac theory, which describes particles of spin 
ih, can only be quantized in accordance with the exclusion principle. 
This is a special case of a general result, obtained by Pauli, ^ according to 
which particles of zero or integer spin obey Einstein-Bose statistics and 
particles of half-odd-integer spin obey Fermi-Dirac statistics. The con- 
nection between spin and statistics can be derived only for relativistic 
theories. For example, the negative energy states that prevent Einstein- 
Bose quantization of the Dirac equation appear only in a relativistic 
theory, and the nonrelativistic Schrddinger equation was successfully 
quantized in both ways in Sec. 46. 

Anticommutatioii Relations at Different Times. All the commutation 
and anticommutation relations that have been used up to now have been 
between quantities that refer to the same time. In a relativistic theory, 
however, there is some reason for being interested in relations of this kind 
between quantities that refer to different times. Such relations can be 
used to investigate the causal connections between events that occur at 
different times and places, and hence the relativistic character of the field 
theory as a whole. 

To see this, suppose that we have a physically observable quantity, 
such as particle or charge density, which can be represented by an opera- 
tor that depends on r and L It is interesting to inquire under what cir- 
cumstances the values of this observable at different space-time points 
can be measured without interference of one measurement with the other. 
Measurements without interference can be made if the two operators that 
represent the observable at the two points commute with each other, 
since then their matrices can be diagonalized simultaneously and precise 
results (eigenvalues) can be obtained from both measurements. We 
expect that observables at the same time and different space points will 
always commute, since there is no way in which an effect can be propa- 
gated over a finite distance in zero time. In a nonrelativistic theory, 
observables at different time and space points need not commute, for 
there is no limit to the speed at which effects can be propagated. In a 
relativistic theory, however, we expect that observal)les at different 
space-time points always commute if the spatial separation of the two 
points is greater than c times the time interval. Thus commutation or 
anticommutation relations at different times can provide a direct physical 
check on a relativistic theory, but are of little interest in connection with a 
nonrelativistic theory. 

The anticommutation relations (47.8) or (47.16) are conveniently 
extended to different times by finding the equations of motion for the 
a^s. We have from Eqs. (45.23), (47.16), and (47.18) that 

1 W. Pauli, Phys, Rev., 68, 716 (1940). 
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2 ftd(k,s; t) = [a(k,s; t)iH] = jBkaa(k,s] t) 
^*ftd*(k,s; t) = [a*(k,s; t)jH] = — jBjcffa*(k,s; i) 

These are easily integrated to give 

o(k,s;/:) = a(k,s;0) exp 

a*(k,s;<) = a’'‘(k,s;0) exp 


(47.22) 


Equations (47.16) apply when both the times that appear are the same; 
they can both be taken to be zero, in which case we can use (47.22) to 
obtain 


[a(k,s; 0.a(k',s'; t')]+ = [o*(k,a; 4)a*(k',s'; i')]+ = 0 
[a(k,s; <),a*(k',s'; t')]+ = Stk'3».' exp — - 


(47.23) 


Substitution of (47.23) into (47.15) enables us to calculate the anti- 
commutation relations for ^’s at different times. It is apparent that 

= mir,t),mr',t')]+ = 0 (47.24) 

The anticommutator bracket for ^ and becomes 

= y i),a*(k',s'; f')]+i^,-(k,s; r)5,(k',s'; r') 

kk^»' 

y,-(k,s; r)j),(k,.s; r') exp ~ (47.25) 

The last sum could be simplified at oneo by making use of the closure 
property 

^ 7/3(k,.s)wKk,«) = 5,7 (47.26) 

8 

which follows from (43.17) and (43.18), wore it not for the fact that E^a 
has different signs for different values of -s. 

As it is, we can still use (47.26) if Ave first roAvrite the exponential in 
(47,25) so that it does not involve explicitly. AVe put 




c-os 


yr 



Hin { \E^s\T/h) 
h 


where t = t — [Akal/A = +c(A’*-* + kl)^ is iiulepoiident of s, and 
fco = mefh. We (am AAO’itc this as 



sin ct{Ic^ + fcj)l 
h ) c(k^ + A’.g)i 
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The remaining factor E^b can be replaced by an operator: 

• grad — mc^rWiKs; r) 

Substitution into (47.25) gives, with the help of (47.26), 

~ i32(kjs; r) 

k,8 

^ d , j , . T « \ n sin ct(A* + /c8)* 

2j V ^ yikiifiii'J Vi'(k,s; r) — 

= (sfl ^ + can • grad + 


, “f“ coiji * grad 4“ hcJco^ji 


where 


*)j5(r - r', 


- t') (47.27) 




It can be shown that (47.27) reduces to the third of Eqs. (47.8) when 
= t (see Prob. 14). 

CommutatiorL Relation for the Charge Density, The charge density 
in the Dirac theory, without the modification (47.20) that results in 
positrons, is 

p(r,«) = #*(r,<)^(r,<) = e ^ (r,i)i^f(r.O 

3 

In order to discuss the extent to which measurements of p at different 
space-time points interfere with each other, we must calculate the com- 
mutator bracket 

[p(r,t),p(r',fO] = 

With the help of Eqs. (47.24) and (47.27), this can be reduced to 
[p(ri0;P(rS^0] = ^ ^ + ca,7 • grad + icko^j^ 

• D(r - r',^ - t') - H.A. 
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where H.A. signifies the Hermitian adjoint of the preceding term. In 
general, p(r,i) will commute with p(r',i') only when 2)(r — r', ^ — ^') is 
zero. 

We therefore examine the structure of the D function by replacing the 
k sum in (47.28) by an integral, as in Eq. (11 .14) 

( 47 . 29 ) 

The integration over the polar angles of k is readily carried out; the result 
can be written 

D(t^t) = (27rVc)“^ j + /cj)”"* sin kr sin ct{k^ + kD^dk 

— — {4^hc)’~^ ^ COB kr sin ct{k^ + fcj)idfc 

The substitution k = ko sinh x transforms this into 


d / “ 

Z)(r,i) = — (47rVc)"^-^ / cos {kor sinh x) sin (koot cosh x)dx (47.30) 
The integrand in (47.30) can be rewritten as 

i sin (koct cosh x + kdr sinh (c) + ^ sin {koct cosh x — kor sinh a;) (47.31) 

Further reduction depends on the relative magnitudes of ct and r. 
Suppose first that ct is positive and greater than r, which is always posi- 
tive; we can then put 

T 

kact cosh X + koT sinh x = ko{cH^ — cosh (a; ± 0), tanh ^ s — 

ct 

The integral in (47.30) then becomes 

i sin [z cosh (x + 6)]dx + ^ sin [z cosh (a? — 6)]dXf 

z s koicH^ — r®)^ (47.32) 

These have the form of one of the integral representations of a Bessel 
function;® 

1 /* " 

Jd{z) = ” / sin {z cosh x)dx 


I P. A. M. Dirac, Proc. Cavih. Phil Soc., 30, 150 (1934). 

* E. T. Whittaker and G. N. Wnteon, “A Course of Modern Analysis,” 4th ed., 
p. 382 (Cambridge, London, 1936). 
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The two integrals in (47.32) are equal, and their sum is then equal to 
tJ 0 ( 21 ) . It is apparent that for ct negative and less than — r, the integral in 
(47.30) is equal to — 7rJo(25). 

To consider the case in which ct lies between r and — r, suppose first 
that ct is positive and less than r. We then use a different expression for 
the arguments of the sines in (47.31): 

ct 

koct cosh X ± kor sinh x = ±Ao(r^ — cH^)^ sinh {x ± 0' s — 

The integral in (47.30) then becomes 

i ^ sin [z' sinh {x + B^)]dx — -J- sin [ 2 ' sinh {x — 6^)]dXj 

z' = k{){r^ — cH^)^ 

Each of these integrals vanishes, since the integrands are odd functions 
of a; + 0' or a; — 0'. We see then that the integral in (47.30) is equal to 

TrjQ[kQ{cH^ — r®)i] for ct > r 

0 for r > ct > —r (47.33) 

— 7rJo[fco(c®^^ — for —r > ct 

We have thus shown that there is no intei’ference between measure- 
ments of charge density at two different space-time points if and only if 
the spatial distance between the points is greater than c times the time 
interval. In this case there is no physical mechanism whereby a dis- 
turbance can be communicated from either point to the other. 

The hypersurfaces ct = ± r in four-dimensional spa(^e-time constitutes 
the light conCy which is the locus of all light pulses that pass throxigh the 
space point r = 0 at the time ^ = 0. An explkiit expression for Z)(r,/) 
infinitesimally close to the light cone can be obtained by noting that 
(47.33) changes discontinuously from tt to 0 as r incu’cases in crossing 
the ct = r half of the light cone, and changes discontinuously from — tt to 
0 as r increases in crossing the ct = —r half of the light cone. Now the 
derivative of an increasing step function is a positive 3 function, so that 
we obtain from (47.30) : 

Z)(r,0 ^ (47iTc)-i[5(r - ct) - + ct)i ct ^ ±r (47.34) 

It is interesting to verify that the right side of (47.27), with the expres- 
sion (47.34) for D near the light cone, reduces to 5,^5 (r) when ^ == 0; this 
shows that (47.27) agrees with the third of Eqs. (47.8) in the proper limit, 
as of course it must (see also Prob. 14). To see this, wo note that the 
terms {cctji • grad + ick^Pji)D{td) vanish when / = 0, since the two 5 
functions in (47.34) cancel. The term (6/0/)D(r,/), hoAvever, becomes 
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— (27rr)”^5'(r) when t = 0. We now show that this quantity is equiva- 
lent to fi(r). Let /(r) be an arbitrary continuous function that has a 
continuous gradient at r = 0; then J/(r)5(r)dr =/(0). We also have 
that 

■“ j /(r)(27r?*)“i5'(r)dr “ f{r){2irr)-'^8^{r)Arr^dr 

= - Kr)2T5'{r)dr 
= — j“^f(r)rd'ir)dr (47.36) 

Since the fourth of Eqs. (11.13) states that r5'(r) can be replaced by 

— 5(r), we see that (47.35) is equal to/(0). 

Problems 

. 1. Make use of Kqs. (45.12), (45.13), and (46.14) to show that the classical 
Hamiltonian equations of motion for a field agree with the Lagrongion equations in the 
cell approximation, 

2. Show that = [F^H] oven if the functional density F depends on grad ^ and 
grad TT as well as on ^ and t. 

3. The wave amplitude ^(ro,0 can be regarded as a functional to which corre- 
sponds the functional density ^(r,05(r — ro), and similarly for Show that Eq, 
(46.20) gives the corr(M*-t ociuations of motion for \f/ and t when uso is made of these 
functional densities. Also u.s(‘ them to cabuilatc the Poisson-bracket expression for ^ 
and TT at different points and the same time. 

4 . Show in detail that the (Commutator of N and the kinetic-energy tenn 
grad • grad ^ in (40.10) is zero, and that the commutator bratikots in (46.6) are 
constant in time. 

6, Make uso of Eqs. (40.14) and (40.16), and the answer to Prob. 1, Chap, VI, 
to show that tho matrix for dk dtdiiu'd by (40,11) (cannot be diagonalized. 

6. Extend (40.11) and the following (equations to the situation in which the Vh 
constitute a continuous s(*t of functions. Then make uso of Prob, 2, Chap. Ill, to 
show that tho nuMisun'd number of pai'ticl(‘,s in the infinitesimal nciighborhood of any 
point is a positive inU*.g('.r or zc'ro. 

7. Show that I'lcjs. (40.21) an^ (u>inpl<4/ely lupiivaleiit to Ikjs. (46.20) if tho^^s 
are orthonormal. 

8 . Show that the iiaticommiitator bracket lias diffevemt algebraic properties from 
tho commutator bracked, and tlu^ Poisson l>ra(ket. 

9. It is shown in tlu' U^xt tliat an ehvtroii field amplitude is not chisaically meas- 
urable, since el('(*trons obey tlie exclusion principh^ and have elec’.tric eliargo. Explain 
how it is that the dilTraction ]iattern of electrons sc.atteix^d by a crystal, which repre- 
sents interforenc-e of (dec.tron amiilitudes, (*.an be measured chifwic-ally. 

10. Show that N is a constant of the motion for iionrolativistic Permi-Dirnc 
particles. 

11. Consider a systciin of Pernii- Dirac particles for which there are two states. 
Obtain explicit matric,(m for the a’s and 'p's that are analogous to (46.20) and (46.30). 

12. Show that tlui unquantiz(Kl Hamiltonian equations for tho Dirac theory agree 
with the corresponding Lagrangian equations. 

13. Show that N given by (47.10) commutes with the Dirac field Hamiltonian 
(47.6). 
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14 . Show that the anticommutation relation (47.27) for ^ and at different 
spaoe-time points reduces to the third of Eqs. (47.8) when the times are made the 
same. Make use of the replacement of the k sum in (47.28) by the k integral in 
(47.29), and a representation of the 5 function given in Sec. 11. 

16 . Show that [p(r,0, p(r',i)] = 0 for both quantizations of the nonrelativistio 
SchjSdinger equation, and for the quantized Dirac theory. 

16 . Show that if the wave functionals satisfy SchrOdinger-type equations ihi 
« =*> ^*jEr, then the calculation of the time derivative of a general 

matrix element of F results in Eq. (45.23) [compare with the transition from (23.1) 
to (23.2)]. Show also that if the set of "^^^s diagonalizes H as well as iV, each ^ oscil- 
lates in time with a frequency determined by the corresponding eigenvalue of H. 



CHAPTER XIV 

QUANTUM ELECTRODYNAMICS 


The field quantization technique developed in the last chapter will 
now be applied to the electromagnetic field. In this case, the classical 
wave theory contains no suggestion of photons, while the quantized 
theory successfully explains the wave-photon duality discussed in Chap. 
I. It is convenient to consider first the electromagnetic field in empty 
space (Sec. 48), and then to take up the interaction between electrons and 
the field (Sec. 49). Some problems that illustrate the theory are solved 
in Sec. 60.^ Since we are not concerned with establishing the Lorentz 
invariance of the theory, we use three-dimensional rather than four- 
dimensional notation throughout. 


48. ELECTROMAGNETIC FIELD IN VACITUM 
We shall follow the methods developed in Sec. 45. The equations of 
motion of the electromagnetic field are Maxwell^s equations, and we 
start by finding a Lagrangian whose variation yields these equations- 
From this, canonical momenta can be defined, and a Hamiltonian can be 
set up. Quantization is effected by replacing the classical Poisson 
brackets by commutator brackets. We shall not consider the possible 
existence of anticommutation relations between the field variables, since 
experiment shows that strong electric and magnetic fields are classically 
measurable and that photons obey Einstein-Bose statistics. 

Lagrangian Equations. Maxwells equations in empty space are 
obtained by setting p and J equal to zero in Eqs. (35.2): 


1 E L ^ ^ , ij 1 5E ^ 

curl E 57 =0, curl H ^ 

C dt ^ C dt 

div E = 0, div H = 0 


(48.1) 


The Lagrangian i>s most conveniently expressed in terms of the potentials 
A,0 that are partially defined by 


E = - - grad H = curl A (48.2) 

C UL 


^ For further diacuasion, aee the rcfcrencoa cited in footnote 1, page 329, and also 
E, Fermi, Rev. Mod. Pkys., 4, 87 (1932); L. Ttoaenfeld, Ann. Inal. Henri Poincarij 1, 
26 (1931); W. Heitlcr, "The Quantum Theory of Radiation/' 2d ed. (Oxford, New 
York, 1944). 
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As discussed in Sec. 35, this does not specify the potentials completely, 
since gauge transformations of the potentials can still be made without 
altering the electric and magnetic field strengths computed from (48,2). 
The Lagrangian density can be taken to be 

The Lagrangian equations are obtained from (45.8) if we regard A*, 

A a, and <i> as the field variables. Variation of the components of A gives 
three equations that can be written together as 

- S curl A - i (i ^ + g,.d ♦) - 0 

This is the same as the second of Eqs. (48.1). Variation of gives 

which is the same as the third of Eqs. (48.1). The definitions (48.2) 
for the potentials automatically satisfy the other two of Maxwell’s 
equations. 

Hamiltonian Equations. The momentum canonically conjugate to 
Aa is found from (45.15) and (48.3) to be 

with similar expressions for the other two momenta. The momentum 
canonically conjugate to <!> vanishes identically, since ^ does not appear 
in the Lagrangian density. A similar situation was encountered with rj/ 
in the nonrelativistic SchrSdinger equation (Sec. 46) and in the Dirac 
equation (Sec. 47) ; as before, it means that (j) cannot be treated as a field 
variable and must be eliminated from the Hamiltonian.^ 

The Hamiltonian density is obtained from (45.24) : 

dA 1 

H = V - L = 27rc2p2 + — (curl A)^ - cP • grad c/> (48.5) 

where use has been made of (48.4) to replace dA/dt by terms involving P. 
The Hamiltonian equations of motion (45.19) are 

^ = 4irc^P - c grad 4>, ^ ^ curl curl A (48.0) 

‘ For an alternative approach, see P. A. M. Dirac, V. Fock, and B, Podolsky, /■'/ly/s. 
ZeitB, S<ywjeiiinimj 2 , 468 ( 1932 ). 
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The first of these equations is the same as (48.4) ; it is necessary that (48.4) 
be obtained over again in this way, since the Hamiltonian formalism con- 
sists only of (48.6) and the canonical variables A and P* We can now 
make use of this equation to define a quantity E that is equal to — 47 rcP. 
Then the second of Eqs. (48.6) agrees with the second of Maxwell's 
equations (48.1), if we also define a quantity H to be equal to cxirl A. The 
first and fourth of Eqs. (48.1) are satisfied because of the way in which 
E and H are defined. 

The third of Maxwell’s equations cannot be obtained as a Hamilton- 
ian equation based on (48.5). We can, however, say that we shall be 
concerned only with those solutions of the Hamiltonian equations for 
which div E = 0, or div P = 0, at some definite time. If then we can 
show that this restriction is maintained at all times, the solutions so 
chosen form a consistent and satisfactory set. The time derivative of 
div P is found from the second of Eqs. (48.6) to be 

d 1 

— divP == — — div curl curl A = 0 

ol 4 t 

Since the field equations are of first order in the time derivatives, we have 
shown that the restri(^tioii that div E = 0 at one instant of time is equiva- 
lent to the validity of the third of Eqs. (48.1) at all times. 

We now see that the last term in the Hamiltonian density (48.5) con- 
tributes nothing to the field Hamiltonian. Its volume integral con be 
transformed by means of a partial integration into c J <#) div Pdr, which is 
equal to zero; the surface term vanishes because P either vanishes suf- 
ficiently rapidly at infinity or obeys peilodic boundary conditions at the 
walls of a large box. The Hamiltonian is then 

7/ = J + 

and 4> has disappoanul, I'liis is in agreement with the usual expression 
(l/87r)J(E’* + H‘'*)dr for the total (uiergy in the electromagnetic field. 

Quantum Equations- Th(^ (fiassical electromagnetic field is converted 
into a quantum lic'ld in the following way. We start with the Hamil- 
tonian (48.7) and the eanonie.al (i(4(l variables A,P. Since <#> no longer 
appears, it is couv(^uieut to elioosc^ th(^ gauge so that (/i = 0. The general 
equation of motion is (46.23), and l.he commutation relations (45.26) 
between the field variables be(a)mc 

[A«(r,/,), = [P«(r,/), P/(rS/)] = 0 

[i4«(r,/.), 7V(r',0] = - r') 

Each of the indices s,s' can be y, or z. 



(48.8) 
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The equation of motion for a typical component of A is 
ihAt(t,t) = [A,(r,0, H] 

Ate commutes with the (curl A)* term in i7, and also with that part of the 
term that involves PJ + PJ; thus we need calculate only the commutar 
tor of Aa and the PJ term in H, This is 2tc^ times the integral over r' of 
[AxjP'J], where the prime indicates that the argument is r' rather than r. 

[AairA PJ(r',i)] = 

= [P'A. + - r')]PJ - P'a^Aa 

= P'JP'aA. + ih8{t - r')] + ihd{x - t')Pi - P^^A. 

= 2ih8{i — r')P«(r',i) 

Integration over r' gives the relation 

ihAa{t^t) = 2irc^j2ih5{t — r')Pa(i'jt)dT' 

= AircHhPaiht) ( 48 . 9 ) 

This is the same as the corresponding classical equation, which is the first 
of Eqs. (48.6), when <#> = 0. 

The equation of motion for a typical component of P is 
iAP*(r,i) = [PAht\H] 

Pa commutes with all the integrand of H except for that part which con- 
tains the sum of the squares of the y and z components of curl A. The 
calculation of this commutator bracket is straightforward but tedious 
(see Prob. 1), and yields an expression for P that is in agreement with the 
second of Eqs. (48.6). Thus if we define E = — 47rcP and H = curl A, 
the quantum equations of motion for A and P agree with the first, second, 
and fourth of Maxwell's equations (48.1). 

The third of Maxwell's equations must be imposed as a supplementary 
condition, as in the classical case. If we set div P equal to zero at a 
particular time, it is always zero since its time derivative is zero. Equa- 
tion (48.9) then shows that the time derivative of div A is always zero, or 
that div A is a constant in time. It is convenient to restrict the choice of 
gauge further so that div A is zero everywhere at a particular time, in 
which case we see that it is zero at all space-time points. It is apparent, 
however, that the introduction of the supplementary condition is incon- 
sistent with the commutation relations (48.8). For example, the com- 
mutator bracket of A* and div P should be zero, since div P is zero, but 
is computed from (48.8) to be 

[A,(r,<), div' P(r',<)] = ih-^ S(r - r') 
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It is not surprising that this inconsistency should arise, since Eqs. (48.8) 
imply that there are three independent pairs of canonical variables, 
whereas the restrictions div P = 0 and div A = 0 cause only two of these 
pairs to be linearly independent. We should therefore modify the com- 
mutation relations so that they are consistent with the supplementary 
condition. 

The nature of this modification is established in Prob. 2. It turns out 
that the commutator brackets of A(r,0 and P(r',0 do not vanish when 
r — r' is finite. This would appear at first to contradict the physical 
principle that there can be no interference between measurements per- 
formed at different places and the same time (see the discussion of Sec. 
47). However, the vector potential A is not in itself a physical quantity; 
only the electric and magnetic fields are directly measurable. We shall 
now show with the help of (48.8) that the commutation relations of E and 
H have the required infinitesimal character and are, moreover, consistent 
with the supplementary condition div E = 0. It can also be shown (see 
Prob. 3) that the same results are obtained by starting with the modified 
canonical commutation relations of Prob. 2. 

Commutation Relations for E and H- The electric and magnetic fields 
are defined by the equations 

E = — 47rcP, H = curl A (48.10) 

where the commutation relations for A and P are assumed to have the 
form (48.8). We see at once that 

[E«(r,/), E-Kr',0] = [Ha(r,/0, = 0 (48.11) 

where each of the indie, es s,s' (uin be :r, y, or z. The commutator bracket 
for typical parallel e.omponents of E and H is 

[E,(r,/.), H*(r',<)] = -4 tc [p., = 0 (48.12) 

Por typical pcrp(^ndi(;ular components of E and we obtain 

[E.(r,t), H„(r',f)] = -4xc [p., 

= 4xcift^,5(r-r') (48.13) 

oz 

other relations similar to (48.13) ai’o obtained, by cyclic permutation of 
•• 5 , 1 /, 2 . 
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It follows at once from (48.11) that div E commutes with all com- 
ponents of E. The commutator bracket of div E and a typical component 
of H is, with the help of (48.13), 

[div E. Ha - [f , Hi] + [f , Hi] 

Since {d/dy^)h{T' — r) = —{d/dy)8(f — r), we see that the right side of 
(48.14) is zero. Thus div E commutes with E and H, and hence also with 
the Hamiltonian, which from (48.7) can be written as 



+ H^)dr 


(48.16) 


This means that div E is a constant of the motion, and so is zero at all 
space-time points if it is made to vanish everywhere at a particular time. 

As would be expected, the field commutation relations (48.11), (48.12), 
and (48.13), together with the Hamiltonian (48.15), can be used in place 
of the canonical formalism originally developed in terms of A and P. We 
have already seen that div E is a constant of the motion; a similar calcula- 
tion shows that div H is constant, so that it can also be made to vanish at 
all space-time points. The first two of Maxwells equations (48.1) 
then follow as special cases of the general equation of motion (45.23) 
(see Prob. 6) : 


ihL = [E., H]=^j [Ex, (Hi» + 

= iAc(CUrl H)a 

iARx = [Hx, [Hx, (E;* + E;=*)]rfr' 

= — ^fc(curl E)x 


(48.16) 


Plane Wave Representation. For many applications, a representation 
of the potentials and fields in a complete orthonormal set of plane waves 
is useful, These plane waves are taken to be vector functions of r that 
are polarized perpendicular to the propagation vector so that the condi- 
tions div A = div P = 0 are satisfied. 


tikx(r) = Z/-®ekx exp ^k • r, X = 1,2 

The vectors k are chosen as in (11.3), so that the Ukx satisfy periodic 
boundary conditions at the walls of a large cubical box of volume L®. The 
Ekx are unit vectors, and eki, ^ 2 , and k form a right-handed set, so that 
^ ' fikx = 0 and div Ukx = 0. It is easily verified that the orthonormality 
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property assumes the form 

j UkX • Uk'X'<fT = 6kk'5xX' 

We expand A and P in terms of the Ukx: 

A(r,«) = y' [gkx(<)iikx(r) + gkx*(Oiikx(r)] 

(48.17) 

[pkxCO'lkxW + PkX*(0^kx(r)] 

The operators ^kx* and pkx* are the Hermitian ad joints of 0 ex and pkx, 
respectively; thus A and P are Hermitian. The primes indicate that the 
summations extend over half the k space, so that the plane waves fikx do 
not duplicate u^kx- 

We take for the commutation relations between the g's and p's 

[Qk\iO} ?>k'x'*(0] = [(Zkx*(0j PkV(0] = iA5kk'5x'x • (48.18) 

with all other pairs commuting, and verify that they give the correct com;- 
mutation relations between A and P. It is apparent that 

[vls(r,/), AAr\t)] = [i"a(r,0, PAr\t)] = 0 

We also obtain from (48.17) and (48.18) 

[Aa(r,i), 2^a^(r^,/.)] ” ^ ^ {[(/kx(0» Pk'x*(0]^kx,fl(r)'Wk'x^a'(r^) 

+ [fykx*(0, Pk'v(0]^kx.«(r)i/k'v.8<r0} 

= y €kx,«€kx.i,'[cxp ik ■ (r - r')] (48.19) 

kX 

The subscripts s,.s' denotes c.ar'tcsian components of the vectors on which 
they appear; the prime has been removed from the last summation in 
(48.19) since the pi-imed summation over terms with both k and — k is * 
equivalent to a summal.ioii of k t(U'ms over the entire k space. 

If there wore thrcM^ mutihally perpendituilar unit vectors Ekx, then the 
three numbers ekx,;, would Ix^ tlu^ dirc^c'tion cosines of the cartesian direc- 
tion s, and we would have that ^ ekx.aekx.H^ = Since there are just 

X 

two unit vectors ckx that are perpendicular to each other and to k> we can 
write 

CkX,«€kX.«' = 
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We also have that 

hk^exp ik • (r - r')] = ^ ^ [exp ik • (r - r')] 

With these substitutions, and the replacement of Y by ( 27 r)“*/dTA 

when L is large, we can rewrite (48.19) as 
[il,(r,^), Pa'(ri,0] = -iA5„^{(27r)-®/[exp zk • (r - rOldr*} 

The first brace expression in (48.20) is equal to 8(r — r'). The second 
brace is the Greenes function ffo(rjr') given in (26.12), which according to 
(26.15) is equal to (47r|r •— r'l)“^ The commutator bracket (48.20) 
then becomes 

[A.(r,l), - M^Kt - O - (iT^Tl) (48.21) 

which is that assumed iu Prob. 2; the other commutator brackets vanish. 
This confirms the choice of the commutation relations (48.18). 

Quantized Field Energy. .Substitution of (48.17) into the field 
Hamiltonian (48.7) gives 

^ ^4xc2pfcxpkx* + ^ gkx(?kx*^ (48.22) 

kX 

since and gkx*, and ptx and pkx* commute; here use has been made of 
the restriction on the summation to half the k space, which makes all 
integrals of the form Jukx ■ Uk'x^dr vanish. 

We wish now to find the eigenvalues of H when the commutation rela- 
tions are as given in (48.18). This can be done by choosing linear com- 
binations of the plane wave amplitudes that make H formally equivalent 
to the sum of the energies of a number of harmonic OxScullators (see Sec. 
46) . Now each index pair k,X corresponds to two linearly polarized plane 
waves that travel in opposite senses along the k direction. Thus w^o 
want our new linear combinations of Qkx'Ukx ^'Ud pkxUkx to have the general 
forms 

Gkx exp i{k • r - kct), a'kx* exp i(k • r + kct) (48.23) 

where Okx and a'kx are operators that are independent of r and L The first 
of these is a plane wave that travels in the positive k direction and has 
the positive angular frequency /cc, and the second is the Hermitian adjoint 
of a plane wave that travels in the negative k direction and has the same 
frequency. 
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With the remarks of the last paragraph as a guide, we proceed to find 
the time dependence of gtx and pkx. Their equations of motion are given 
by (46.23) : 

= [ffkXjH] = 47riftc®pkx 

... r m ihk^ (48.24) 

tApkx = [Pkx,«] = - ®cx '• ' 

Elimination of pkx gives a second-order equation for gtx 

gkx = 4irc'^pkx = -fcVgkx 

This is easily integrated to give 

gkx(«) = akkC-**" + a'kxV”* (48.25) 

where we have followed the pattern indicated by (48.23). We then 
obtain from the first of Eqs. (48.24) 


(48.26) 


Equations (48.25) and (48.26) can be solved for the a's. 

^ / , 47rtc \ 

I 7kx H — ^ I 

a'kx* = i (qTL\ - 13 

Similar relations hold for tlieir llermitian adjoints. The commutation 
relations for the a’s can be obtained from (48.27) and (48.18) : 


ftihoi 


ft — lAoi 


(48.27) 


[%X,«*k'X'] = [^i'kXjrt' Vx'] = ^kk'^XX' 


(48.28) 


with all other pairs c.ommutinfi;; these are independent of the time, as 
they should be. 

Substitution of (48.25) and (48.26) into the Hamiltonian (48.22) 
gives 

« = ^ (Okxakx* + a'kx*o'kx) (48.29) 

kX 

If we adopt the definitions 

A^kx = 2 " ^^ " ^■kx*(Jkx, A^^kx = (48.30) 

we see from the work of See. 40 that Nk\ and A^'kx each have the eigen- 
values 0,1,2, .... In terms of the AT's, the Hamiltonian (48.29) 
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becomes 

H = Y + A^'kx + 1) (48.31) 

kX 

Because of the structure of (48.23) and (48.25), we can identify a'tx with 
a-kx, and AT'kx with iV-kx. Then the restriction on the summation of 
(48.31) to half the k space can be removed to give 

H = y Uk{Ni^ + i) (48.32) 

kX 


Equation (48.32) is equivalent to Planck's quantum hypothesis: the 
energy associated with each plane electromagnetic wave is an integer 
multiple of the fundamental quantum hv = hkc. In addition to the 
Planck energy, however, there is the harmonic-oscillator zero-point 
energy of one-half quantum per state of the field, which is infinite since 
there are an infinite number of states. This infinite energy is not objec- 
tionable since it does not interact with charged matter.^ 

Quantized Field Momentum. The momentum density of an elec- 
tromagnetic field is the Poynting vector (c/47r)E(r,0 X H(r,i) divided by 
c^. The total momentum in the field can then be Avritten in terms of the 
canonical variables, with the help of (48.10), 


G = ^/ EXHdr J VX (curl A)dT 

Substitution from (48.17), (48.25), (48.26), and (48.30) gives 

G = t ^ k(pkx5kx* - Pkx Vx) 

= ^ V k7c[(akxakx* + akx*akx) - (a'kxa'kx* + a'kx*a'kx)] 

kX 

= ^ + i) - (X^'kx + ^)] = y 

kX kx 


(48.33) 


where the restriction on the summation is removed as in the transition 
from (48.31) to (48.32). In this case, the zero-point terms cancel for 
plane waves that travel in opposite directions. 

Equations (48.32) and (48.33) show that the energy and momentum 
of each plane wave are quantized in units of hkc for the energy and hk for 
the momentum. It Avill also be shown explicitly in Sec. 50 tliat the intcr- 


^ See also the discussion following Eq. (50.17), 
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action between matter and radiation is such as to account both for the 
interference of light and for the discrete properties of light quanta. 

A(r,i) in the Plane Wave Representation. The vector potential 
appears in the interaction between electrons and the electromagnetic 
field discussed in the next two sections. We shall therefore require an 
expression for A(r,^) in the plane wave representation that is specified by 
the eigenvalues nkx of the operators ATkx. A typical wave functional for 
this representation can be written as ^(- • • rikx • • Oj which describes a 
state of the electromagnetic field in which there are nkx light quanta with 
momentum hk and polarization ckx* We then see from Eq. (48.28) and 
the work of Sec. 46 that the operators akx and akx* have the properties 


. . ntx • • •) = . . nkx - 1 • • •) 

• • ntx • • •) = ^(. • • n^x + 1 


(48.34) 

• •) 


We obtain from (48.17) and (48.25) 


A(r,f) = L- « tkx[(«kx«“*‘'‘ + a'kx*e''*‘‘)(exp it • r) 

kX 

+ + a'kxe”^*‘'0 exp(— ik ■ r)] 

= ^ ^ £kx{^-kx[oxp ^‘(k * r - kct)] 

kx 

+ a-kx* exp[— i(k ' r — kct)]] (48.35) 


Here again the rc^st-riction has been removed from the summation by 
identifying a'kx with a.„kx. This expression for the vector potential is 
easily seen to Ix^ llermitian, as it must be. It follows from the stmeture 
of (48.34) that akx and akx*aro destnuition and creation operators, respec- 
tively, for a light ciuantiim in th(5 state k,X. Thus a term in the Hamil- 
tonian linear in A would give rise to the emission and absorption of light 
quanta. 

Commutation Relations at Different Times. It is interesting to 
generalize the commutation relations (48.11), (48.12), and (48.13) for 
the components of E and H, to the case in which the times are different.^ 
As with the quantizcul Dirac equation (Sec. 47), the result shows under 
what circumstaiu'-es measurements of the electromagnetics fields at dif- 
ferent space-time points alTect ea(^h other. 

Expansions for E and H in terms of the akx, that are analogous to 
(48.36), can be found without difficulty: 

' These commutation relations are due to P. Jordan and W. Pauli, Zeiia. f. Phyaik^ 
47 , 161 ( 1928 ). 
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E(r,i) = L“8 ^ifcek\{akx[exp ^■(k * r — hct)] 

— flkx* exp[— i(k * r — /bci)]} (48.36) 

H(r,i) = y i(k X 8kx){akx[exp i(k • r — kct)] 

kX 

— akx* exp[— i(k • r — /bc^)]} 

The commutator bracket for two cartesian components of the electric 
field strength is 

[EsCr,^), = i"“® Y 47rtftcfc€kx,fl€kx,«' sin (k • ^ — kcr) (48.37) 

9 = r - r', r = t - f 

where use has been made of (48.28). The summation over the polariza- 
tion index X can be evaluated by the technique used on the similar sum- 
mation in (48.19): 



€kx.fl€kx. 8 ' sin (k • 9 — kcr) = ^ — hh') sin (k ■ 9 — kcr) 


The summation over k can be evaluated by replacing it with an integral 
for large L: 


jL“® X sin (k • 9 — kcr) ( 27 r)-® J sin (k • 9 — kor)drk 

k 

= ( 27 r)“®J (2ifc)“i[exp (tk * 9 — ikcr) 

— exp (— ik * 9 + ikcT)]drh 
= ( 27 r)~®J ( 2 ^/c)'“i[exp (tk • 9 — ikcr) 

— exp (^k • 9 + ikcT)]dTk 
= — ( 27 r)‘“®Jfc”^(exp (tk • 9) sin kcr dru 


We denote the last expression by — cDo( 9 ,r), where Do is the same as the 
D function of Eq, (47.29) except that fco is set equal to zero. 

We can see from (47.30) and (47.33) that Do(9,r) is obtained as the 
result of operating with — (47rV)”K^/^p) on a function that is equal to 
rfTT for CT > p, 0 for p > CT > — p, and — tt for — p > cr. Thus Do has 
the form (47.34) for all p, not just for p in the infinitesimal neighborhood 
of |crl : 

-Do( 9»7-) = (47rpc)“^[5(p — cr) - 5(p + cr)], p = |r - r'| (48.39) 
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It then follows from (48.37) and (48.38) that 

= -4«Ac* I - A ^ j i>„(r - r', f - «') (48.40) 

The commutator bracket for two cartesian components of the magnetic 
field strength can be found in the same way, and is 

[H.(r,0, = [E.(r,0, E.<(r',0] (48.41) 

An expression for the commutation relation between components of 
E and H can also be obtained from (48.3()). 

[E,(r,0, ^ 

= Ij~^ ^ 47 rtA 6 ‘A'(eki,„ 6 k. 2 ,„^ — €k2,,6ici,g/) sin (k • 9 — kcr) (48.42) 

k 

It is apparent that (48.42) vanishes if s = s\ so that parallel components 
of the electric and magnetic field strengths commute at all space-time 
points. If s 7 *^ s', we can put 6 ’ = s' = y, where x^y^z form a righW 
handed set of axes, in which case we see that 

Cki.x^ks.v Cks.xCki,!/ ~ (ckl X £k2)* — 

An analysis similar t,o that whi(0\ 1 (m 1 from (48.37) to (48.40) then shows 
that 

[Ex(r,<), H„(r',t')l = ■\Tifw. - IMt t') (48.43) 

The subscripts x,y,z (^an bo permuted cyclically in (48.43). 

Since all these commutation relations involve the function (48.39), 
^ve aee that all (components of the field strengths (jommute except in 
the infinitesimal neighborhood of the light cone — i{') = ±|r — r'|. 
Thus the field strengl.hs at spac.(vtiino points so situated that a light sig- 
nal cannot pass from either one to (Jic other, commute Avith each other 
and can both ])e measun^d ])rec.isely. This shows that the quantized 
electromagnel.ic fic'.ld is propagated with the cliiHsuuil speed of light c. 
A discussion of the eonmud.ion Ixitwc^en these commutation relations and 
the uncertainty principle luis Ix^cn given by Bohr and Rosenfeld.^ 

49. INTERACTION BETWEEN ELECTRONS AND THE 
ELECTROMAGNETIC FIELD 

The quantization of Dirac’s rtdiil ivisilc wave equation for free elec- 
trons Avas carried through in Bc^c. 47. We sliall now combine this fre(.>- 

' N. Bohr and L. Rowcinfold, Det Kgl. Danske Vidensk. SHskah^ Mat.-Jys, Mcdd., 12, 
8 (1933). 
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electron theory with the quantum theory of the electromagnetic field in 
vacuum, given in Sec. 48. The resulting formalism is called quantum 
electrodynamics; it provides a description of the interaction between elec- 
trons and the electromagnetic field. 

TjigrflngiflTi and Hamiltonian Equations. We wish first to obtain a 
Lagrangian whose variation yields the earlier equations of motion for the 
electron field and the electromagnetic field. Dirac’s electron field equa- 
tion is (43.22); 

ih^ — e^ -|- a • (— iftcgradV' — eA^) - 1 - = 0 (49.1) 

at 


where e is the charge on the electron, and hence is a negative number. 
Maxwell's equations for the electromagnetic field are given in (35.2) : 




III 1 dE 47r _ 

curl H — ^ — J 

C dt C 


div E = 4^p, div H = 0 


(49.2) 


The electric charge and current densities that go into (49.2) can be 
obtained from equations (43.20): 


p = J = —ec^^fsiyp (49.3) 

We assume for the present that p and J are due entirely to electrons. We 
can verify the electric continuity or conservation equation (35.3) by 
multiplying (49.1) on the left by its Hermitian adjoint equation on 
the right by and taking the difference of the two results. 

The desired Lagrangian is just the sum of the Lagrangians for the 
separate electron and electromagnetic fields, with the electron operators 
ih(d/dt) and —ihc grad replaced by ih(d/dt) — e<j) and —ihc grad — cA, 
respectively. We thus obtain from (47.1) and (48.3) 


L = y ^ ‘ { — grad — cA)\l/ + rnc-jSi/'j dr 

+ a / [0 ^ *)' - 

It can be shown that variation of xp* in (49.4) leads to (40.1), variation of 
\p leads to the Hermitian adjoint of (49.1), and variation of A leads to 
Eqs. (49.2) when use is made of (49.3) and (48.2) (see Prob. 9). 

The Lagrangian (49.4) suffers from the defects noted earlier in con- 
nection with (47.1) and (48.3). Since and (p do not appear in (49.4), 
their canonical momenta cannot be defined, and so \p* and <p must be 
eliminated as coordinate variables from the Hamiltonian theory. As 
before, the momentum canonically conjugate to a component xpf of xp is 
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and the momentum canonically conjugate to a component Aa of 
A is P» = (47rc)'"n(Vc)(^^»/^0 + The Hamiltonian then 

becomes 


= J • (i^cgrad + eA)4f + e<l>4' — 7nc^p\l/]dr 

+ I |^27rc2p2 + ^ (curl A)® - cP • grad dr (49.6) 


and appears as a canonical momentum variable. 

It is not difficult to show that the Hamiltonian equations of motion 
for V' and its canonical momentum are (49.1) and its Hermitian 
adjoint, respectively. The Hamiltonian equations for A and P are 


^ = 47rc2P - cgrad<;> 
ot 

ap 1 

— = — curl curl A — 


(49.6) 


Thus if we define E = — 47rcP and H = curl A, as before, we obtain the 
first, second, and fourth of Maxwell’s eciuations (49.2). 

Elimination of 0. The third of Maxwell's equations (49.2) must be 
obtained as a supplomoiitary condition, as the corresponding equation was 
in Sec. 48. We siuill thcj'ofore be concerned only with those solutions of 
the Hamiltonian equations for which divE — 47r(?i/'*i^ = 0 at some definite 
time. If then tlio time derivative of this (juantity is zero, tlie restriction 
is maintaincKl at. all times, and the solutions so chosen form a consistent 
and satisfac.tory h( 4.. With the help of the second of Eejs. (49.0) and the 
definition of E, we sc^e that 

^ (div E — = Ittc I c (liv(^*«^) — ^ 

dl L - 

This is zero hocwiuso of (49.3) and th(5 (upiatioii of continuity for the elec- 
tric charge and (uirrcnt (h^nsitics. 

We (am now se(^ that th(i two tei-ins on the right sid(^ of (49.5) that 
involv(i (j) (ain(a4 ea(;h otlmr. The secanul 0 t<M-in can 1)0 integrated by 
parts to give 

— cjP • grad <l) dr = c[<t) div P dr = — 

which is e(pial to and opposite in sign from the first <t> iovm cj(j)yp'^\l/dT. 
Thus (j) has disappeared from the Hamiltonian, and may be chosen in any 
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convenient way. The choice is made so that when P (or E) is divided into 
solenoidal and irrotaiional parts, the latter is expressed entirely in terms 
of 0. We put 

P = Pi + P 2 

div Pi = 0 (Pi is solenoidal) (49.7) 

curl P 2 = 0 (P 2 is irrotational) 

If now we put P 2 = (47rc)"^ grad <l>, we see that the third of Eqs. (49.7) 
is satisfied, and that the first of Eqs. (49.6) becomes 

^ = 47rc^Pi (49.8) 

It is now possible to have div A = 0 at all times if it is true at any one 
time, since (49.8) and the second of Eqs. (49.7) show that (d/dt) div A 
= 0. We therefore choose the gauge so that div A = 0. 

The 0 potential reappears in the Hamiltonian (49.6) through the P‘^ 
term. The volume integral of P^ can be written, with the help of the 
expression for P 2 and a partial integration, 

jP^dr = JVldr + ;(2Pi + P 2 ) • P^dr 

= J PJdT + ^ J (2Pi + P 2 ) ■ grad «dT 

= J Tldr - j 4> div (2Pi + I>,)dT 

Now div Pi = 0, and the supplementary condition is that div P 2 = —pic; 
thus the P^ term in H becomes 

27rc^jP‘‘*dr = 2TC^jPidr + -J-J <i>pdr (49.9) 

The choice of 0 tells us that 

= 4tc div P 2 = — 47rp 

This equation can be integrated by making use of the Greenes functi(jn 
(26.15) with k = the result is 

= y dr' (49.10) 

The Hamiltonian (49.5) can now be rewritten, with the help of (49.9) 
and (49.10), 

H = J \l/*[a • (ihc grad -h eA)0 — 

+ y [2^c»P? + ^ (curl A)»jdr + i J J drdr' (49.11) 



Sma. 49] 


QUANTUM ELECTRODYNAMICS 


377 


Here, div Pi = div A = 0, and p(r,i) = e^*(r,i)^(r,i). The last term in 
(49.11) is the internal Coulomb energy of the electric charge distribution 
p(r,Q ; it results from the elimination of and the irrotational part of P, 
and does not have to be inserted into the theory as a separate assumption. 

The solenoidal field vectors (Pi and A) are usually referred to as the 
transverse part of the electromagnetic field, since the electric and mag- 
netic field strengths in the corresponding plane wave solutions are trans- 
verse to the direction of propagation, as in Sec. 48. The irrotational 
Coulomb field vector (P 2 ) is called the longitudinal part of the field, since 
Eq. (49.10) shows that the contribution to P 2 at one point from an 
infinitesimal element of charge at another point is along the line joining 
the two points. 

Quantization of the Fields. We can now obtain a quantum theory 
of the interacting electron and electromagnetic fields by adopting the 
equation of motion (45.23), the electron field anticommutation relations 
(47.8), and the electromagnetic field commutation relations (48.21) with 
Pi substituted for P. It will also be assximed that all components of 
and commute with all components of A and Pi. 

The order of factors like and \l/i that are multiplied together in the 
Hamiltonian (49.11) is of course immaterial in the unquantized case. In 
the quantum theory, however, these factors do not commute with each 
other, and the result of a particular calculation will depend on the order 
in which they appear in the Hamiltonian. We shall see at the end of this 
section that a suitable Hamiltonian is obtained if all terms are left as 
they are, except for a change in the integrand of the Coulomb term. This 
integrand in (49.11) contains the term 

4 4 

p(r,/.)p(r',0 = y y (49,12) 

I L 

which we replace by' 

4 4 

y y (49.13) 

It can be shown with the lu^lp of tlio anticommutation relations (47.8) 
that (49.12) is equal to (49.13) plus 

4 

^ - r') 

1 


^ Note that both (49.12) and (49.13) arc Hcnnitian. 
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Thus the change from (49.12) to (49.13) is equivalent to the subtraction 
from the Hamiltonian (49.11) of the quantity 


2 ; j 


^ drdr' 


(49.14) 


This is evidently infinite unless is zero; it will be shown below 

that this can happen only if there are no electrons in the field. 

The quantum equations of motion are obtained from (45.23), when 
the Hamiltonian (49.11) is modified by substitution of (49.13) for (49.12). 
The electromagnetic field equations agree with those obtained in Sec. 
48, except that Pi replaces P and the electric-current term appears as in 
the second of Eqs. (49.6). The electron field equation is the same as 
(49.1), except that (f) is replaced by 


It can then be shown that the time derivatives of the bracket expressions 
in (47.8) and (48.21) are zero, so that these equations hold for all time if 
they are valid initially, as was assumed (see Prob. 11). 

Inclusion of Static Fields. Thus far we have assumed that the 
electric charge and current densities arise entirely from the electrons that 
are described by the Dirac \l/ field. The effect of a static charge distribu- 
tion can easily be included, by the addition of a term 47rp« to the right 
side of the third of Eqs. (49.2), and a term —e<l>^\l/ to the left side of (49.1), 
where = — 47rpfl. It is not diflScult to see that the only effect on 
the Hamiltonian (49.11) is to add a term 

The situation of greatest practical interest is that for which^ 


(l>a = — -Ze/r. 

This corresponds to a fixed (infinitely massive) point nucleus of atomic 
number Z, placed at the origin. With this addition, and the modifica- 
tion (49.13), the Hamiltonian (49.11) becomes 

H — y* • (ihc grad H- cA)^ — ^ yp — 

+ j 1 (curl Ay^dr + ^ j J ^ drdr' (49.15) 

Jl 

where the primes indicate that the arguments are r' rather than r. 

Use of Perturbation Theory. It is natural to attempt to find the 
eigenvalues of the Hamiltonian (49.15), which would be the energy levels 
of the system of electrons, electromagnetic field, and nuclear Coulomb 
* e ia the electronic charge, and hence negative. 
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field. All such attempts have failed, and there is reason to believe that 
these eigenvalues do not exist: that it is impossible to diagonalize this 
Hamiltonian. This reason derives from a perturbation approach that is 
based on the smallness of e. If e is set equal to zero, (49.15) becomes 
simply the sum of the free-electron Hamiltonian (47.6) and the vacuum 
electromagnetic-field Hamiltonian (48.7). These Hamiltonians have 
already been diagonalized; the eigenvalues correspond to solutions in 
which there are definite numbers of free electrons and of light quanta, 
with no interaction between any of them. 

For a finite value of c, neither the nuclear term of order Ze®, nor the 
interelec.tron Coulomb interaction of order causes a fundamental 
difficulty. As we show below, the latter term would give an infinite 
electrostatic or longitudinal self-enei'gy (just as in the classical theory of 
point charges), were it not for the somewhat arbitrary substitution of 
(49.13) for (49.12). A more serious trouble arises from the ea ■ A term, 
which couples the electrons to the transverse electromagnetic field. This 
term is responsible for all interactions between electrons and light quanta, 
and will be us(^d in the next section for the calculation of the emission 
and absorption of light by an atom. One of the effects that the ea • A 
term produces is an infinite transverse self-energy of a free electron, due to 
the virtual emission and reabsorption of light quanta by it;^ we shall 
ignore this effect in what follows.” 

We shall work mainly with the perturbation approach. In the 
remainder of this section we consider the matrix elements of the inter- 
electron Coulomb interaction, and ignore the transverse electromagnetic 
field. This example is of interest since it shows how the infinite electro- 
static self-energy is eliminated, and how the exchange interaction between 
electrons (whi<*.h obey Fcu’mi-Dirac statistics and are described by anti- 
symmetric many-partich^ wave funcjtions) comes out of the quEintized 
field theory. The jiorturbation treatment of the ea • A term will be 
taken up in So(*.. 50. 

Matrix Elements of the Coulomb Interaction. We now consider the 
Hamiltonian (49.15) with the ti’ansvcrse electromagnetic field omitted: 


H 




grad \f/ — ■— \l/ — mc^p\l/ 


) 


dr 


+ 




' Soe Hoitlor, op, cil.i 18*2. 

* A rolativiHtir.ally invariant way of subtracting out infinities of this type has 
recently boon clovolopod by H, Tojiionaga, Progrtms of Theor. Physics (Kyoto), 1, 
27 (1946), Phys, 74, 221 (H)1S), and by J. Schwinger, Phys. Rev., 78, 416 (1948). 
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This turns out . to be an adequate approximation when the speed of 
the electrons is small in comparison with the speed of light, since then 
light quanta are not likely to be emitted. In this case, it is possible to 
use the nonrelativistic Schrodinger equation (with spin included) to 
describe the electrons, although we shall continue to use the Dirac 
equation. 

There is a complete orthonormal set of energy eigenfunctions for a 
single electron in a Coulomb field, that we denote by 


j ^ '^i(nyr)wj(n' jT)dT = 5 „ 


?( 


ihca^i • glad 


Ze^ 




(49.17) 


1 = EnWiin^x) 


These can become the free-electron solutions (47.11) when Z is set equal 
to zero. We expand and in terms of the ly's as in (47. 15) : 


^ i»(n,0w,-(n,r) 

n 

(r,<) = ^ b*in,t)Wi(n,T) 


(49.18) 


where , the b’s are operators that obey anticommutation relations like 
■(4ll6): 

[b{n,t), bin',t)]+ = 6*«i)]+ = 0 , 

[bin,t), b*(n',t)]+ = S,.- 

Substitution of (49.18) into (47.10) gives, with the help of the ortho- 
normality relation in (49.17), 

N = f = J b*(n,t)b(n,t) = ^ N„, Nn = b*(n,t)h(n,t) 

n n 

In similar fashion, the first term in the Hamiltonian (49.16) becomes 

^ b*(n,t)b(ri,t)En = ^ (49.20) 

71 n 

In any calculation in which the second term in (49.16) is regarded as a 
perturbation on the first term, we are interested in finding the matrix 
elements of the second term in the representation (specified by the ft^s 


1 This set includes positive- and negative-energy continuum states as well as 
> bound states; it is made discrete by adopting periodic boundary conditions. The 
index n includes specification of the spin. 
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and that diagonalizes the first term. Substitution of (49.18) into 
the Coulomb-interaction term gives 





V *' 


(49.21) 


We wish to find the matrix element of the operator (49.21) that corre- 
sponds to any pair of unperturbed wave functionals (see Sec. 46). Since 
electrons obey the exclusion principle, these wave functionals can be 
specified by stating which of the single-electron states n are occupied. 
Now, the are destruction operators, and the are creation operators. 
Since two of each appear in each term of (49.21), this operator has non- 
vanishing matrix elements only for wave functionals that correspond to 
the same total number of electrons, and for which not more than two 
electrons are assigned to different states in the two wave functionals. 
Moreover, since the two destruction operators appear to the right of the 
two creation operators in each term of (49.21), the only non vanishing 
matrix elements are for wave functionals that correspond to the presence 
of two or more electrons. 

We see then that (49,21) is a null operator so far as a single electron is 
concerned, so that the infinite longitudinal self-energy of single electrons 
has been eliminated. We now show that this elimination was effected 
by the replacement of (49.12) by (49.13). The difference energy (49.14) 
can be written 


^ ^ j y ~ r'l'^wjy(n,r)%(n',r')5(r — rOdrdr' 

nn' j 

(49.22) 

Since b(n^,t) destroys an electron in the state n', and b*'‘(?i,i) creates an 
electron in the state n, the diagonal matrix elements of (49.22) are those 
for which n' = n. Then b*{nyt)b{7i\t) can be replaced by N«5nn', and 
there is a positively infinite contribution to the expectation value of 

(49.22) for each electron that is present. Thus the substitution of (49.13) 
for (49.12) was equivalent to the subtraction of the infinite electrostatic 
self-energy of each electron from the Hamiltonian. 

The expectation value or diagonal matrix element of (49.21) for a 
wave functional that corresponds to two or more elec^trons is a sum of 
terms, each of which involves a pair of oc.oupied electron states. The 
term that refers to the states I and 2, for example, contains the matrix 
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elements of four combinations of 6^s multiplied by appropriate volume 
integrals; these four combinations are abbreviated as b2&i62&i, 6162^261, 
b^bibibi, and 616*^^162. The anticommutation relations (49.19) show 
that the second and third of these are equal to each other and opposite in 
sign to the first and fourth. We can also see from (49.19) that (compare 
with Prob. 12) 

>I^*(1,1, . . .)bJ 6 J 62 fii^(l,l, . . .) 

= ^*(1,1, . . .)iViiV2^(l,l, . . .) = +1 (49.23) 

Equation (49.23) then shows that the part of the expectation value of 
(49.21) that refers to the occupied states 1 and 2 is 

/ f \t- r'|-* y |%(l,r)|* y \wi{2,iTdTdT' 

Y I 

-..j I Ml ,r)tuy(2,r) ^ idi(2^T')wi{ljT')dTdT' (49.24) 

The second integral in (49.24) is called the exchange energy, and also 
appears when the expectation value of the Coulomb interaction is calcu- 
lated for an antisymmetric many-electron wave function like (32.7). 

60. RADIATION THEORY 

The quantum theory of radiation consists of the perturbation treat- 
ment of the ea • A term, and usually also the term, in the Hamiltonian 

(49.15). Calculations have been carried through for several processes of 
physical interest that are of various orders in the electronic charge e.^ 
In most cases it is unnecessary to use the quantized Dirac field that is 
implied by (49.15), since only one electron at a time is involved and the 
theory of Sec. 43 is adequate. 

In this section we consider only the simplest processes of emission and 
absorption of light by an atom, and continue to use the complete quan- 
tized field theory. We assume that the nuclear Coulomb energy and the 
interelectron interaction energy can be combined into an effective atomic 
potential energy V(r) of the Hartree type for the electron under con- 
sideration (see Sec. 38). The resulting formulas agree with those 
obtained in Chap. X by means of a semiclassical treatment. At the end 
of this section, we show how quantum electrodynamics provides a quanti- 
tative explanation of the diffraction experiment discussed in Sec. 2. 
Both the wave-like properties of radiation (appearance of a diffraction 
pattern) and the particle-like properties (ionization of an atom by absorp- 
tion of a light quantum) result from Eq. (49.15) or (50.2). 

^ Details of some of these calculations and references to the original papers are 
given by Heitler, op. cit. 
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Formulation in Terms of Transition Probabilities. The development 
of quantized field theory in the last two chapters has been entirely from 
the point of view of Heisenberg^s form of the equations of motion of the 
field variables (see Sec. 23). This formulation centers attention on the 
dynamical variables rather than on the states of the system. Now, how- 
ever, we wish to calculate transition probabilities between states of the 
system of electrons and electromagnetic field, and in this way obtain 
expressions for the rates of emission and absorption of light quanta by an 
atom. This suggests tliat we make use of the time-dependent perturba- 
tion theory of Sec. 29, which was in fact first developed by Dirac in con- 
nection with the present problem.^ 

The states are represented by wave functionals that can be taken to 
satisfy the Schrodinger-type equations 


ih^ = mr 


(60.1) 


where H is given by (49.15). It can then be shown that the dependence 
on time of the matrices that represent dynamical variables such as ^ and A 
arises from the change in time of the wave functionals 'k that are used to 
calculate the matrix elements (see Prob. IG, Chap. XIII). Thus the 
time dependence is transferred from the dynamical variables to the wave 
functionals; the former will now be regarded as operators whose structures 
are independent of t. 

Before ])r()(! 0 (Mling further, we simplify the Hamiltonian by approxi- 
mately repliKung ih(^ elT(M',t of tlu^ niudcms and all the other electrons on a 
particular clccit.ron by a llai’trcHvtypc potential energy y(r). With this 
change, Eep (49.15) bec-onn^s 


U - //n + //' 

f/o = j • grad ^ f E(r)^ — m(r(S^]dT 

-i- j ^ (curl A)’* dr 

ir = • A^pdr 


(60.2) 


The unporturbed TTjuniltoriian //o can bo rewritten by expanding A in 
plane wav(^s and in eigcuifunctions of Eq. (49.17), where the nuclear 
potential —Zr'lr is replaced by V{r). Wo obtain from (49.20) and 
(48.32) 


^ F. A. M. Dime, Tror. Roi/. Sor., A112, ()()1 Soc. 6; A114, 243 (1927). 
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Fo = 2 

n kX 

iV„ = 6;6„, iVkx = ^^Okx*akx 


(50.3) 


The commutation relations are given by (49.19) and (48.28) : 

[bnM+ = lK,b.>*]+ = 0, [&»,6n'*]+ = 

^irhc 

[akx,^Jk'v] = [a*kN,a\/x'] = 0, [flkXj^iVv] = '-y-' Skk'^xx' 


The a^Q and b*& commute with each other. 

The perturbing energy H' in (50.2) can also be rewritten by making 
use of the expansions (49.18) and (48.36). We can choose t to be an 
arbitrary constant (say i = 0) in these expressions, since the time depend- 
ence of the field variables now arises from the wave functionals. The 
result is 


H' = eL-i y I y 6;%(n,r)ekx • «/* 

nn^X jl 

■ [ufcx exp (^k • r) + akx* exp (— ik • T)]bn'Wi{n\T)dT (50.4) 

We specify the unperturbed wave functionals by giving the quantum 
numbers n of the occupied electron states and the number of light quanta 
TZkx in each plane wave state of the electromagnetic field: 

^ ^ f ^ if the state n is occupied 

[0 if the state n is empty (50.5) 

A^kx^ = rikx^ 

It then follows from (50.1) (with H replaced by i?o) together with (50.3) 
and (50.5), that oscillates in time with a frequency that is determined 
by the sum of the energies of the electrons and light quanta present. If 
we omit the infinite zero-point energy of the electromagnetic field, we see 
that ^ has the time dependence 

exp 1^ — ^ jBn + ^ (50.6) 

n kX 

where the prime denotes a summation over only the occupied electron 
states n. 

We are now in a position to apply the time-dependent perturbation 
theory of Sec. 29. The form of (50.6) shows that only transitions that 
conserve the total energy of electrons and light quanta take place. We 
require the matrix elements of H' in the representation in which Hq is 
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diagonal. These can be obtained by operating with (50.4) on the wave 
functionals ^ in accordance with Eqs. (48.34) for the a’s and equations 
like (46.32) for the 6^s, and making use of the orthonormality property 
expressed by the first of Eqs. (46.20). 

Matrix Elements of the Perturbation. The perturbation W given by 
(60.4) is a sum of terms, each of which contains one 6, one b*, and either 
an a or an a*. Thus the effect of i?' on a wave functional is to destroy 
an electron in one state, create an electron in the same or a different state, 
and either destroy or create a light quantum. This corresponds physi- 
cally to the transition of an electron from one state in the potential V{r) to 
another, accompanied by the absorption or emission of a light quantum. 

We consider first a transition in which a light quantum is absorbed. 
The initial wave functional represents a state of the entire system 

in which there are nkx light quanta in the state k,X, an electron in the state 
n', and perhaps other light quanta and electrons that do not take part 
in the transition. The final wave functional represents a state in 

which there are ntx— 1 light ciuanta in the state k,X, on electron in the 
state n instead of the state n', and whatever other light quanta and elec- 
trons that were present in the initial state. The transition frequency 
0)21 3 0)2 — oji is given by 

Ao)oi = En — En' — hck (60.7) 

The matrix element of 7f' for this transition is^ 




21 



ir)/(n,r)(exp ik • r)ckx 


o:,it«i(n',r)dr 


(50.8) 


We can approximate to (50.8) in the nonrelativistic limit by replacing 
the Dirac single-electron wave functions ty,(7i,r) by Bchrodinger wave 
functions Wn(t), We then see from Eep (43.21) and Prob. 6, Chap. XII, 
that the velocity operator is — ca, and that this can bo replaced by 
(-ift/m)grad in the nonrelativiati(5 limit. The matrix element (50.8) 
then becomes 

^ j w«(r)(exp ik • r)Ekx • grad w„'it)dr (50.9) 

The factor exp ^k • r can bo placed on either sido of the gradient operator, 
since only the component of grad along ckx comes in, and this is per- 
pendicular to k. 


^The sign factor 0n0„'f which ariHCH from Eqs. (46.32), can be ignored in the 
remainder of this section. 
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Transition Probability for Absorption. We shall make use of the 
nonrelativistic approximation in the subsequent development, since the 
results obtained are then directly comparable with those of Chap. X. 
Equation (29.9) gives the amplitude of a particular wave functional at 
the time t. From this and Eq. (50.9) we obtain the total transition 
probability per unit time for absorption of a light quantum in any of the 
states k,X: 


1 V 4|gai|=^ sin^ 
t Z/ toll 


Z/ 7n?(f)L^ 


kX 




(exp ik • r)ckx * grad dr\ 


sin^ 

oJai^ 


(60.10) 


where we have put co = fee. 

We now suppose, as in the work of Sec. 35, that the states n and n' are 
discrete, and that the incident radiation covers a range of angular fre- 
quencies CO in the neighborhood of {En — Enf)/h. The radiation can 
then be described by an intensity I{oi)do3 for the infinitesimal frequency 
range dco. It is convenient to replace the summation over states k,X of 
the radiation field by an integration over co. Each quantum contributes 
an amount ftco/L® to the energy density, or an amount fteco/L® to the 
intensity. We can therefore replace 

X I /it '“''t 

kx 

The time-dependent factor on the right side of (50.10) has a sharp 
maximum at co 2 i = 0, which because of (50.7) is the same as 

hoo ~ Efi — En'> 

The other factors are relatively slowly varying; after the substitution 
(50.11) is made, they can be taken outside of the co integral, which 
can be written 

f “ sin^ ^co2i/ 1 

J — <X> 

Thus the absorption probability (60.10) becomes 

(50.12) 

which agrees with the corresponding expression (35.17) obtained in 
Chap. X. 




m'cw 


-1/ 


Wn(exp ^k • r)Ekx * grad Wn^ drl 
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Transition Probability for Emission. The energy relation (60.7) 
implies that the electronic state n has a higher energy than the state n'. 
We can now find the transition probability per unit time for emission of a 
light quantum when the electron goes from state n to state n'. The 
aiix* term in (50.4) is involved, and the matrix element analogous to (60.9) 
is 

^ 3 + 1) ^ j irv'[exp ( • r)]e kx • grad Wn dr, (50.13) 

where Wkx is the number of light quanta initially present in the electro- 
magnetic field state k,X. A calculation similar to that which led from 
(50.9) to (50.12) gives for the emission probability 

I J i5„.[oxp (-tk • r)]ekx • grad dr * 

+ ^ (-■ ifc • r)]ck;> • grad dr 

kX 

These two terms evidently arise from rikx and from 1, respectively, 
in the factor (w-kx + 1)^ of (50.13). The first term is proportional to the 
intensity of the incident radiation, and agrees with the expression (36.19) 
for the indiK^ed emission probability. The second term is independent 
of the intensity of the radiation present initially; we now show that it 
agrees with the spontaneous emission probability obtained in Chap. X. 

Wc can simplify the second term in (50.14) by replacing the summa^ 
tion over k with an integral over co or co 2 i. In order to do this, we require 
an expression for the number t)f states of the electromagnetic field that lie 
in the anguhir-fr(M|uen(*.y range to to w + dw. It follows from the discus- 
sion of periodic, boundary condil.ions in Sec. 11 [see Eqs. (11.3)] that there 
are (L/2Ty(ik/lkydk, plane waves with propagation vectors in the range 
dkjikydks. Thus if we spe(ufy the direction of the propagation vector k 
by the polar angh^s with resi')ec;t to some fixed set of axes, the number 
of plane waves with angular frequenciy in the range dca that have direc- 
tions of propagation in the angular range ddd<l> is 

sin dd6d^d(j3. 

The integration over co in the second term of (60.14) can be carried out as 
before; the result is 

w“n'[cxp (— tk * r)]£kx • gi^ad Wndr 
X 


Jo Jo L4 27rmV-^ 


sin^ 

coiii 


(60.14) 


sin 6d6d<i) (60.15) 
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The. expression to the right of the summation in (50.15) is the prob- 
ability. per unit time for spontaneous emission of a light quantum with 
propagation vector k in the angular range ddd(i>^ and polarization X. Thus 
(50.15) is the total spontaneous emission probability associated with the 
transition n In order to compare it with the corresponding expres- 

sion in Sec. 36, we specialize to the dipole case by replacing exp (— tk • r) 
by unity and grad by — (mw/A)r in the integrand of the matrix element 
[see Eq. (35.20)]: 

2 

sin dddd<t> (60.16) 

X 

The two directions of polarization for each propagation vector can be 
chosen in any way so long as they are perpendicular to each other and to 
k- If we choose one in the plane of k and the matrix element (r)n'n, and 
the other perpendicular to this plane, only the former is emitted; then a 
factor sin® 6 appears in the integrand, where 6 is the angle between k and 
(r)„^n. Thus the polarization and angular distribution of the emitted 
radiation are in agreement with those found in Sec. 36. The total spon- 
taneous dipole emission probability obtained from (50.16) is 


io jo h 27 rhc^ ^ J 


/iWndTl 


(/ Wn'tWndr * J Wri»tv)nd^ sin BdBd^) 


4e®a}® 


l(r)„..|® (50.17) 


which agrees with (36.22). 

Both the spontaneous and induced emission probabilities appear as 
the result of a single calculation in quantum electrodynamics, while these 
two effects were calculated in quite different ways in Chap. X. As was 
pointed out above, the spontaneous emission arises from the 1 in the 
factor (wkx + 1)^ that appears in the second of Eqs. (48.34). This in 
turn comes from the commutation relations (48.28), and hence is a 
purely quantum-mechanical effect. From a formal point of view, we can 
say that the spontaneous emission probability is equal to the probability 
of emission that would be induced by the presence of one quantum in 
each state of the electromagnetic field (see Prob. 4, Chap. X). Now we 
have already seen in Eq. (48.32) that the smallest possible energy of the 
field corresponds to the presence of one-half quantum per state. This 
suggests that we regard the spontaneous emission as being induced by 
the zero-point oscillations of the electromagnetic field; note, however, 
that these oscillations are twice as effective in producing emissive transi- 
tions as are actual quanta, and are incapable of producing absorptive 
transitions. 
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Analysis of a Diffraction Experiment. As our final example, we con- 
sider the diffraction experiment that was discussed in Sec. 2,^ It is 
desirable to simplify the calculation as much as possible, and we start by 
reducing the arrangement of Fig. 1 to its essentials. These essentials 
consist of a source of light S, a diaphragm A with two slits cut in it, and a 
detector of light that can be placed at various points in the plane B. 
The source is taken to be a single atom that is in an excited state and can 
radiate a light quantum. The detector is taken to be a different kind of 
atom in its ground state, which becomes ionized if it absorbs the light 
quantum given off by the source atom (photoelectric effect). The 
diaphragm is assumed to be made of perfectly reflecting material; its 
atomic structure is ignored, and it simply imposes certain boundary 
conditions on the electromagnetic field. 

The physical process in which we are interested is that in which the 
initially excited source atom emits a light quantum, and the detector 
atom absorbs the quantum and is ionized. Actually, however, the light 
quantum cannot be observed in transit unless an arrangement like that 
of Fig. 2 is used, and we are not interested in such an experiment. What 
we wish to calculate is the transition probability from a state in which the 
source atom is excited, the detector atom is in its ground state, and no 
quanta are present, to a state in which the source atom is in its ground 
state, the detector atom is ionized, and no quanta are present. This 
tells us how the probability of observing ionization depends on position in 
the plane B when a light source is placed at S, We shall see that this 
probability is proportional to the intensity calculated by classical electro- 
magnetic theory when a light source is at 8, In this way, quantum 
electrodynamics predicts both the diffraction pattern that is characteristic 
of light waves, and the ejected photoclectrons that are characteristic of 
light quanta. 

The form of the Hamiltonian [hkjs. (50.2) and (50.3)] tells us that elec- 
trons on different atoms interact with each other only through the cou- 
pling between each electi*on and the electromagnetic field. Thus the 
process with which we are concerned occurs only in second order of the 
perturbation energy //' of (50.2).^ Since no quanta are present initially 
or finally, the intermediate states are those for which a single quantum is 
present, and either both atoms are in their ground states or the source 

^ See also G. llaoah, Accad. Lincei Rand.f 11, 837, 1100 (1030); W. Heisenberg, 
Ann. d. Physik, 9, 338 (1031); E. Fermi, op. di.y Sec. 10. 

* We aBsnmo that the BpontiinconB radiation probability of the aonreo at.oin is small 
enough so that the remarks of footnote 1 , page 192, are applicabh^. Then the over-all 
transition probability per unit time is constant for times that are long enough so that 
energy is conserved between initial and final states. 
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atom is excited and the detector atom is ionized. The jSxst type of 
intermediate state corresponds to a double process in which the source 
atom makes a transition to its ground state and emits a light quantum, 
and the detector atom makes a transition to its ionized state and absorbs 
this quantum. Since the intermediate state need not conserve energy, 
the light-quantiim energy does not have to equal the initial excitation 
energy of the source atom (see Sec. 29). The second type of intermediate 
state corresponds to a double process in which the detector atom makes a 
transition to its ionized state and emits a light quantum, and the source 
atom makes a transition to its ground state and absorbs this quantum. 
In this case, it is apparent that the intermediate state cannot conserve 
energy. The second-order matrix element for the over-all process is 
obtained by summing expressions like (29.20) over all possible intermedi- 
ate light-quantum states of both types. 

The structure of the calculation that follows does not depend on the 
particular experimental arrangement of Fig. 1. The actual diffraction 
pattern is not found explicitly; instead it is shown that the quantum and 
classical results agree for any arrangement of perfectly reflecting dia- 
phragms, with or without slits. This conclusion is not surprising, since 
Maxwellh equations have the same form in classical and quantum electro- 
dynamics. Nevertheless, it is interesting for us to see explicitly in what 
way agreement is obtained. It is shown below that the summation over 
intermediate light-quantum states gives an expression that is equivalent 
to the point-source solution (Greenh function) of the electromagnetic 
wave equation. 

Representation of the Electromagnetic Field. The expansion of the 
electromagnetic field in plane waves that was introduced in Sec. 48 and 
used earlier in this section is not suitable for the present problem, since 
plane waves do not satisfy the proper boundary conditions at the surface 
of the diaphragm. The functions that do satisfy these boundary con- 
ditions are quite complicated, and we do not attempt to find explicit 
expressions for them. We simply assume that they exist and form a 
complete set in terms of which the vector potential can be expanded. 
The assumption that the surface of the diaphragm is perfectly reflecting 
means in the general case that these functions have to be real. We make 
the set discrete by placing the entire system in a large but finite closed 
box that has perfectly reflecting walls. The cartesian components of 
the vector functions Uifc(r) satisfy the second-order wave equation 

+ -^ Uks = Oj s = Xyy,z (60.18) 
c 

where div = 0. Since the tangential components of the electric field 
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and the vector potential vanish at a perfectly reflecting surface, the 
boundary condition is 

n X Ufc = 0 (50.19) 

where n is the vector normal to the surface of the diaphragm or to the 
walla of the box. 

We now show that two of these functions that correspond to different 
eigenvalues 03k are orthogonal to each other. We multiply (60.18) by 
the corresponding equation for Uk'^ir) by Ufc«(r), take the difference 
of the two results and sum over s = XjyjZ. The result can be written, 
with the help of Green’s theorem, 

/ X (“•■• ^ '-w) I 

a^x,y,z 

where {d/dn) denotes the component of the gradient in the direction of 
the outward normal. The surface integral on the left extends over both 
the diaphragm and the walls of the box. According to (60.19), the 
tangential components of Ua: vanish on the boundary surfaces; this implies 
that the tangential derivatives of these components are also zero. Then 
since div u* = 0, the normal derivative of the normal component of 
Ufc vanishes. It folloAvs that at the bounding surfaces, Ua, is a vector 
perpendicular to the surfac^e, and dUk/On is a vector parallel to the sur- 
face, so that their s(^alar produc’-t is zero. Thus the surface integral on the 
left side of (60.20) vanislics, and Ju// ■ UA,dr = 0 if 9 ^ co/fe. Any degen- 
erate solutions of (60.18) can bo c.hoson orthogonal to each other, and all 
solutions can be normalized in the entire region, so we can put 

Ju/,' ‘ MhdT = dkk> (50.21) 

We now proc(^ed as in Rec. 48 tio expand A and P for the vacuum field 
in terms of the functions u*,: 

A(r,/) = y ( 7 *(/.)u*(r), P(r,0 = ^ p*(0u*(r) (60.22) 

k k 

where qk and Pk are Ihu’mitian and satisfy the commutation relations 

WkiOj “ ihSkk' TfiO 

fo(/), qkiO] = \pk{l), Pk^U)] = 0 

Substitution of (50.22) into the (dectromagnetic field Hamiltonian (48,7) 
gives, with the help of (50.21), 
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The integral on the right can be simplified by means of a partial inte- 
gration: 

/(curl u*) • (curl Ui)dT = Juj, • curl curl Uj dr 

where the surface integral vanishes because of the boundary conditions. 
This integral can be simplified further if Uj is expressed in cartesian coordi- 
nates and use is made of Eqs. (50.18) and (50.21): 

/(curl Uj) • (curl u^dr = — /uj, • Vhitdr 

= ^ j Ui ■ U|dT = 

The field Hamiltonian then becomes 

^ ^2tc^ 2 -b ^ (60.24) 

k 

The quantum equations of motion for gh and pk are obtained from 
(46.23), (50.23), and (50.24): 

= 47rc*pi, 3* 

These are readily integrated to give 

Qh = -I- o]Ee““ 

PJ. = - ^ - o:Je'““) 


where a* and aj are operators that do not depend on the time. It is 
easily verified that the a's satisfy commutation relations like (48.28) : 




27rAc^ 

COfc 


dhk' 


with all other pairs commuting. Thus we can identify a* and aj with 
destruction and creation operators, respectively, for quanta in the state 
k of the electromagnetic field. The Hamiltonian (50.24) becomes 

^ + i), N, = ata, 

k 

This is in agreement with (48.32) since co* corresponds to the quantity ck 
in Sec. 48. 

Matrix Elements. The second-order matrix element (29.20) can be 
written as the sum of two terms that correspond to the two types of 
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(g;),o,(g^),>.x 
Ea\ — Ebu — huk 


(60.26) 


(H's) ofc.i is the matrix element for the transition of the source atom from 
the excited state of energy Esi to the ground state of energy Eso with 
emission of a quantum in the state k:^ 


(i?a)o*.i = ^ (^~) / ’»ao(rOui,(r') ■ grad' «)«i(r')dT' 

Qk is the matrix element for the transition of the detector atom from 
the ground state of energy Ebo to an ionized state of energy Eai with 
absorption of the quantum in the state k: 

^ f ,2)^<(r)u*(r) • grad Wflo(r)dT 

me \ / J 

The second term (uin similarly be written 



Too, A.0 

Ebo — Eiii — hciih 


(60.26) 


It is readily verified from the eai'lier work that (i¥a)o,ifc = and 

The timo-depentkmt perturbation theory of Sec. 29 tells us that the 
transition probability per unit time is appreciable only if energy is con- 
aeiwed bctwi^en initial and final states. Thus we ai'o interested in those 
ionized states for whi(di 


PJjii — ji?7io = Eai — Eho 


If we call tills energy dilTer(ui(*.c Aoi, the sum of (60,26) and (50.26) can 
be written 

The elccitronic. wave fmie.lions v> that appear in (60.27) are well 
localized around eitlier t.he sourcfe atom or the detector atom. Since we 
are primarily intere.,sted in macroscopic obaervations, we neglect the 

' As was (lone earlier in lliis sec.tion, we ehaiiRe lo a representation in which the 
time dcpenrlencn appcMira in the wave fiiuetimials rather than in the operators, since 
this lends itself more readily to uixplieal.ion of the time-dependent perturbation theory. 
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spatial extent of these wave functions. The transition probability then 
depends on the positions r' of source and r of detector through the expres- 
sion in square brackets. For a large contaming box, the summation over 
fc is to be replaced by an integration over cojt, where the contour C is 
chosen in accordance with Eq. (29.24). We define a density of states 
p(A;), such that p(A)do3jfc is the number of states of the electromagnetic field 
in the angular-frequency range doj*. We can then put 

[ ] "" ^P + iR (60.28) 

where P is the principal value of the integral and is tt times the residue 
of the integrand at the pole oja = w. It is apparent that P and R are real. 
The transition probability is proportional to the square of the magnitude 
of (60.28), or to 

P^ + R^ (50.29) 

Classical Diffraction Pattern. We have now to compare this result 
with the classical expression for the intensity of light produced at r by a 
source at r'. This can be found from a solution of the wave equation 
(35.9) for the vector potential A(r,^) that is produced by a current density 
Jiht): 

(50.30) 

We are interested in a solution of Eq. (50.30) when the source current J 
is small in spatial dimensions and oscillates sinusoidally in the time. It 
will be apparent from what follows that an analytical difficulty is encoun- 
tered if the time dependence of J is purely harmonic. We therefore make 
the physically plausible assumption that J represents a damped oscilla- 
tor, and later take the limit in which the damping constant is negligibly 
small. We put 

J(r,0 = cos 0)1 

= ■s-J(r)[e^“'’'‘^^' -t- 

Since Eq. (50.30) is linear, its steady-state solutions have the form 

A(r,0 = + ■jA(r)e^"''^^' (50,31) 

where A(r) satisfies the equation 

V»A(r) + i (o, + i 7 )*A(r) = - ^ J(r) (50.32) 

c c 

Equation (50.32) can be solved in the same way as was the inhomo- 
geneous wave equation (26.7). We expand A(r) in terms of the complete 
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set U)i,(r) of real vector functions that satisfy the boiindaiy conditions 
stated earlier: 

A(r) = ^ i4*u*(r) (60.33) 

k 

The constant expansion coefficients may be complex since A(r) is not 
necessarily real [the vector potential A(r,i) is of course real]. Substitu- 
tion of (50.33) into (50.32) gives, with the help of (50.18), 

- ^ ^ A*[to| - (w + i-Y)“]Uft(r) = - ^ J(r) 


This can be solved for A* with the help of the orthonormality relation 
(50.21). 

Substitution of (60.34) into (60.33) gives for a caiiesian component of 
A(r) 

A.(r) = 47rc J ^ ./,.(r') 

The summation in square brackets can be replaced by an integration 
over real values of wjt without difficulty, since for finite 7 the pole of the 
integrand is above the real axis. Tii the limit 7 0, the pole moves into 

the real axis, and th(5 int(^gi’al approaches the value computed with the 
contour C of K{\. (2U.24). This is just the integral (50.28), so that we 
can replaces tlic sciuan^brac-ket expression in (50.36) by P-f iR. The 
intensity of light uK^asured at r is proportional to the time average of the 
square of tlu^ vector potcuitial (50,31). For a small current source 
located at the point r', this intensity is proportional (in the limit 7 -^ 0 ) 
to 

{[(P + /'Pje'**’' + {P — = 4[(P cos 03 t — R sin 

= 2(P2 + P2) 

The agreement h(d.we(ui l.liis result and (50.21)) shows that the probability 
of finding an ioiiizcul atom at a pn,rti(nilar point is proportional to the 
classically computed intensity of light at that point. 




Problems 

1. Calc.iiliitr tho c.oninuiUilor hmeke^t of [curl A(r,01* and Pa(rV). Use the 
result tA) hIiow tliiil, tlio ociuation of motion for the quantity P in a vacuum electro- 
magnetic field \h given by the Hccond of K(ih. (48.0). 
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t 

2. Show that if ihe commutator bracket of two cartesian components of A and P 
is given by 

[A,(T,t), p..(r' i)] = - r-) - g ^ (^) 

then A and div P at different space points commute with each other. 

8. Show that the replacement of the last of the commutation relations (48.8) by 
that of Prob. 2 does not affect those commutator brackets that involve the field 
strengths. 

4. Show that div H commutes with the electromagnetic field Hamiltonian (48.16), 
and hence is a constant of the motion. 

6. Carry through the calculations implied in Eqs. (48.16), and show that the 
first two of Maxwell^s equations are obtained. 

6. Calculate the commutator bracket of each of the bracket expressions in (48.18) 
and the Hamiltonian (48.22), and show that the bracket expressions are constant in 
time. This means that these commutation laws are consistent with the equations 
of motion. 

7. (Dompare the D function given in Eqs. (47.30) and (47.33), and the Do function 
given in Eq. (48.39), from a physical point of view. 

8 . Derive the expression (48,43) for the commutator bracket of EB(r,0 and 
Hv(r'/). 

9. Obtain the Lagrangian equations of motion for and A from Eq. (49.4). 

10. Find the quantum equation of motion for ^ when the Hamiltonian is given by 
(49.11) with the substitution of (49.13) for (49.12). Show that it agrees with (49.1) 
if is replaced by c/[^*(r',0^(r',i)/lr r'l]dr'. What result is obtained if the above 
subsitution is not made? 

11. Show that the bracket expressions in (47.8) and (48.21) are constant in time 
for the Hamiltonian of Prob. 10. 

12. Verify Eq. (49.23) by making use of the anticommutation relations (49.19), 
and also by making use of operator equations like (46.32). 

13. Calculate a typical off-diagonal matrix element of the Coulomb interaction 
energy (49.21), and show that it is in agreement with what would be obtained from on 
antisymmetric many-electron wave function like (32.7). 

14. Carry through a calculation like that given in the latter part of Sec. 50, except 
that the diaphragm is omitted and plane waves with periodic boundary conditions are 
used in place of the real vector functions Uife(r). Show explicitly that the probability 
of ionization of the detector atom is inversely proportional to the square of its dis- 
tance from the source atom. Compare this calculation with that of Sec. 30 which 
dealt with the production of a cloud-chamber track. 
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